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METHODS FOR THE PRODUCTION OF MULTIMERIC PROTEINS, 

AND RELATED COMPOSITIONS 

* * 

RELATED APPLICATIONS 

Benefit of priority under 35 U.S.C. §1 19(e) is claimed taU.S. provisional 
5 application Serial No. 60/302,885, filed July 5, 2001 , to van Rooijen, et aL, 
entitled "METHODS FOR THE PRODUCTION OF REDOX PROTEINS". This 
application is also a continuation-in-part of U.S. utility application Serial No. 
10/006,038, filed December 4, 2001 to van Rooijen, et aL, entitled "METHODS 
FOR THE PRODUCTION OF REDOX PROTEINS"; which is a continuation-in-part 
10 of U.S. utility application Serial No. 09/742,900, filed December 19, 2000 to 
Heifetz, etaL, entitled "METHOD OF PRODUCTION AND DELIVERY OF 
THIOREDOXIN". This application is also a continuation-in-part of U.S. utility 
application Serial No. 09/742,900. The subject matter of each of the provisional 
and utility applications is incorporated herein by reference in its entirety. 

1 5 Field Of The Invention 

The present invention relates to multimeric-protein-complexes, redox 

proteins, and recombinant polypeptides; and improved methods for their 

production. 
BACKGROUND 

20 Multimeric proteins (i.e. proteins comprising multiple polypeptide chains) 

are a biologically and commercially important class of proteins. Antibodies for 
example are multimeric proteins which are used to treat a wide range of disease 
conditions. However in view of their complexity, multimeric proteins frequently 
represent significant manufacturing challenges. 

25 Redox proteins are also a commercially important class of proteins with 

applications in a variety of different industries including the pharmaceutical, 
personal care and food industry. For example, the redox protein thioredoxin may 
be used in the manufacture of personal care products (Japanese Patent 
Applications JP9012471A2, JP103743A2, JP1 1 29785A2), pharmaceutical 

30 compositions/products (Aota et aL (1996) J. Cardiov. Pharmacol. (1996) 27: 

727-732) as well as to reduce protein allergens present in food products such as 
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milk (del Val et al. (1999) J. Ailerg. Vlin. Immunol. 103: 690-697) and wheat 
(Buchanan et al. (1997) Proc. Natl. Acad. Sci. USA 94: 5372-5377). 

However, there is a need in the art to further improve the methods for the 
recombinant expression of multimeric proteins, including redox proteins. The 
5 present invention satisfies this need and provides related advantages as well. 

SUMMARY OF THE INVENTION 

The present invention relates to novel and improved methods of 
producing a first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
10 heteromultimeric-fusion-proteins, immunoglobulin-polypeptide-chains, 

immunoglobulins, redox-fusion-polypeptides, and/or thioredoxin-related proteins; 
in association with oil bodies. Accordingly, provided herein are methods of 
producing a recombinant multimeric-protein-complex, said method comprising: 
(a) producing in a cell comprising oil bodies, a first recombinant polypeptide and 
15 a second recombinant polypeptide wherein said first recombinant polypeptide is 
capable of associating with said second recombinant polypeptide to form said 
multimeric-protein-complex; and (b) associating said multimeric-protein-complex 
with an oil body through an oil-body-targeting-protein capable of associating with 
said oil bodies and said first recombinant polypeptide. 
20 The method further contemplates isolating the oil bodies associated with 

said recombinant multimeric-protein-complex. The second recombinant 
polypeptide can be associated with a second oil-body-targeting-protein capable 
of associating with an oil body and said second recombinant polypeptide. Each 
of said oil-body-targeting-proteins can be an oil-body-protein or an 
25 immunoglobulin. The oil-body-targeting-protein can be an oleosin or caleosin. 
When the oil-body-targeting-protein can be an oleosin or caleosin, the first 
recombinant polypeptide can be fused to said oleosin or caleosin. Likewise, the 
second recombinant polypeptide can be fused to a second oleosin or second 
caleosin capable of associating with an oil body. The first and second 
30 recombinant polypeptides can be produced as a multimereic-fusion-protein 

comprising said first and second polypeptide, and can form a multimeric-protein- 
complex. The multimeric-protein-complex can be a heteromultimeric-protein- 

RECTIFIED SHEET (RULE 91) 



WO 02/50289 



PCT/US01/50240 



-3- 



complex, and the heteromummeric-protein-complex can be an enzymatically 
active redox complex or an immunoglobulin. In one embodiment, the first 
recombinant polypeptide is capable of associating with said second recombinant 
polypeptide in the cell. In another embodiment, the first recombinant 
5 polypeptide can be a thioredoxin and the second recombinant polypeptide can be 
a thioredoxin-reductase. In particular embodiments, the thioredoxin can be 
selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 

10 NOs:195-313. In another embodiment, the first recombinant polypeptide can be 
an immunoglobulin-polypeptide-chain. For example, the first recombinant 
polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 

15 this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 
or protein G. The cell can be a plant cell, such as a safflower cell, and the like. 

Also provided herein is a method of expressing a recombinant multimeric- 
protein-complex comprising a first and second recombinant polypeptide in a cell, 

said method comprising: 
20 (a) introducing into a cell a first chimeric nucleic acid sequence comprising: 

(i) a first nucleic acid sequence capable of regulating transcription 

in said cell operatively linked to; 

(H) a second nucleic acid sequence encoding a first recombinant 

polypeptide; 

25 (b) introducing into said cell a second chimeric nucleic acid sequence comprising: 

(i) a third nucleic acid sequence capable of regulating transcription 

in said cell operatively linked to; 

(ii) a fourth nucleic acid sequence encoding a second recombinant 

polypeptide; 

30 (c) growing said cell under conditions to permit expression of said first and 

second recombinant polypeptide in a progeny cell comprising oil bodies wherein 
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said first recombinant polypeptide and said second recombinant polypeptide are 
capable of forming a multimeric-protein-complex; and 

(d) associating said first recombinant polypeptide with an oil body through an oil- 
body-targeting-protein capable of associating with said oil bodies and said first 
5 recombinant polypeptide. This method further contemplates isolating from the 
progeny cell, oil bodies comprising the multimeric-protein-complex. The second 
recombinant polypeptide can be associated with a second oil-body-targeting- 
protein capable of associating with an oil body and second recombinant 
polypeptide. Each of said oil-body-targeting-proteins can be an oil-body-protein 
10 or an immunoglobulin. The oil-body-targeting-protein can be an oleosin or 

caleosin. When the oil-body-targeting-protein is an oleosin or caleosin, the first 
recombinant polypeptide can be fused to said oleosin or caleosin. Likewise, the 
second recombinant polypeptide can be fused to a second oleosin or second 
caleosin capable of associating with an oil body. The first and second 
15 recombinant polypeptides can be produced as a multimereic-fusion-protein 

comprising said first and second polypeptide, and can form a multimeric-protein- 
complex. The multimeric-protein-complex can be a heteromultimeric-protein- 
complex, and the heteromultimeric-protein-complex can be an enzymatically 
active redox complex or an immunoglobulin. In one embodiment, the first 
20 recombinant polypeptide and said second recombinant polypeptide are capable of 
forming a multimeric-protein-complex in said progeny cell. In another 
embodiment, the first recombinant polypeptide can be a thioredoxin and the 
second recombinant polypeptide can be a thioredoxin-reductase. In particular 
embodiments, the thioredoxin can be selected from the group consisting of SEQ 
25 ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-1 94; and the thioredoxin-reductase 
can be selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 
10, 40, 44, 48, 50 and SEQ ID NOs:195-313. In another embodiment, the first 
recombinant polypeptide can be an immunoglobulin-polypeptide-chain. For 
example, the first recombinant polypeptide can be an immunoglobulin light chain, 
30 or an immunologically active portion thereof, and the second recombinant 

polypeptide can be an immunoglobulin heavy chain, or an immunologically active 
portion thereof. In this embodiment, the oil-body-targeting-protein can comprise 
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protein A. protein L or protein G. The cell can be a plant cell, such as a 

saf flower cell, and the like. 

Also provided herein are methods of producing in a plant a recombinant 

multimeric-protein-complex, said method comprising: 
5 (a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide wherein said first recombinant polypeptide is 
capable of associating with said oil bodies through an oil-body-targeting-protein; 
(b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide; and 
10 {c) sexually crossing said first plant with said second plant to produce a progeny 
plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 
are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 
15 The second recombinant polypeptide can be associated with oil bodies through a 
second oil-body-targeting-protein in the second plant. The oil bodies can be 
isolated from the progeny plant comprising said multimeric-protein-complex. The 
oil-body-targeting-protein can be selected from an oil-body-protein or an 
immunoglobulin, wherein the oil-body-protein can be an oleosin or caleosin. The 
20 first recombinant polypeptide can be fused to the oleosin or caleosin; and the 
second recombinant polypeptide can be fused to a second oleosin or second 
caleosin capable of associating with an oil body. The first and second 
recombinant polypeptide can form a multimeric-protein-complex, such as a 
heteromultimeric-protein-complex, wherein the heteromultimeric-protein-complex 
25 can be an enzymatically active redox complex or an immunoglobulin. In a 

particular embodiment, the first recombinant polypeptide is a thioredoxin and the 
second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 
30 consisting of those set forth in SEQ ID NOs:8, 9, 1 0, 40, 44, 48, 50 and SEQ ID 
N0s:195-313. In another embodiment, the first recombinant polypeptide can be 
an immunoglobulin-polypeptide-chain. For example, the first recombinant 
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polypeptide can be an immunoglobulin light chain, or an immunologically active 

i 

portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 
5 or protein G. The plant can be a safflower plant. 

Also provided herein are chimeric nucleic acids encoding a multimeric- 
fusion-protein as described herein, said nucleic acid comprising: 
(a) a first nucleic acid sequence encoding an oil-body-targeting-protein 
operatively linked in reading frame to; 
10 (b) a second nucleic acid sequence encoding a first recombinant polypeptide; 
linked in reading frame to; 

(c) a third nucleic acid sequence encoding a second recombinant polypeptide, 
wherein said first and second recombinant polypeptide are capable of forming a 
multimeric-protein-complex. The oil-body-targeting-protein can be selected from 

15 an oil-body-protein or an immunoglobulin. The oil-body-protein can be an oleosin 
or caleosin. The multimeric-protein-complex can be a heteromultimeric-protein- 
complex, and the first and second recombinant polypeptide can form an 
enzymatically active heteromultimeric redox complex or an immunoglobulin. In a 
particular embodiment, the first recombinant polypeptide is a thioredoxin and the 

20 second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs;52-194; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313. In another embodiment, the first recombinant polypeptide can be 

25 an immunoglobulin-polypeptide-chain. For example, the first recombinant 

polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 

30 or protein G. In yet another embodiment, positioned between the nucleic acid 
sequence encoding an oil-body-targeting-protein and the nucleic acid sequence 
encoding a first recombinant polypeptide can be a linker nucleic acid sequence 
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encoding an oil-body-surface-avoiding linker amino acid sequence. The oil-body- 
surface-avoiding linker amino acid sequence can be substantially negatively 
charged, or have a molecular weight of at least 35 kd. Optionally, the gene 
fusion further comprises a linker nucleic acid sequence encoding an amino acid 
5 sequence that is specifically cleavable by an enzyme or a chemical, wherein the 
linker sequence is positioned between the oil-body-surface-avoiding linker amino 
acid sequence that is also a non-proteolytic linker and said sequence encoding 
the first recombinant polypeptide. 

Also provided herein are recombinant multimeric-fusion-proteins 
10 comprising (i) an oil-body-targeting-protein, or fragment thereof, (ii) a first 

recombinant polypeptide and a (iii) second recombinant polypeptide, wherein 
said first and second recombinant polypeptides are capable of forming a 
multimeric-protein-complex. The oil-body-targeting-protein can be selected from 
an oil-body-protein or an immunoglobulin, and the oil-body-protein can be an 
15 oleosin or a caleosin. The multim eric-fusion-protein can be a heteromultimeric- 
fusion-protein, wherein said first and second recombinant polypeptide form an 
enzymatically active beteromultimeric redox complex or an immunoglobulin. In a 
particular embodiment, the first recombinant polypeptide is a thioredoxin and the 
second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
20 be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313. In another embodiment, the first recombinant polypeptide can be 
an immunoglobulin-polypeptide-chain. For example, the first recombinant 
25 polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 
or protein G, In yet another embodiment, positioned between the nucleic acid 
30 sequence encoding an oil-body-targeting-protein and the nucleic acid sequence 
encoding a first recombinant polypeptide can be a linker nucleic acid sequence 
encoding an oil-body-surface-avoiding linker amino acid sequence. The oil-body- 

RECTIFIED SHEET (RULE 91) 



WO 02/50289 



PCT7US01/50240 



surface-avoiding linker amino acid sequence can be substantially negatively 
charged, or have a molecular weight of at least 35 kd. Optionally, the gene 
fusion further comprises a linker nucleic acid sequence encoding an amino acid 
sequence that is specifically cleavable by an enzyme or a chemical, wherein the 
5 linker sequence is positioned between the oil-body-surface-avoiding linker amino 
acid sequence and said sequence encoding the first recombinant polypeptide. 

Also provided herein are isolated oil bodies comprising a multimeric- 
protein-complex comprising (i) an oil-body-targeting-protein and (ii) a first 
recombinant polypeptide, said oil bodies further comprising a second 

10 recombinant polypeptide, wherein said first and second recombinant polypeptide 
are capable of forming a multimeric-protein-complex. The oil-body-targeting- 
protein can be selected from an oil-body-protein or an immunoglobulin, and the 
oil-body-protein can be an oleosin or a caleosin. The multimeric-fusion-protein 
can be a heteromultimeric-fusion-protein r wherein said first and second 

15 recombinant polypeptide form an enzymatically active heteromultimeric redox 
complex or an immunoglobulin. In a particular embodiment, the first 
recombinant polypeptide is a thioredoxin and the second recombinant 
polypeptide is a thioredoxin-reductase. In another embodiment, the first 
recombinant polypeptide can be an immunoglobulin-polypeptide-chain. For 

20 example, the first recombinant polypeptide can be an immunoglobulin light chain, 
or an immunologically active portion thereof, and the second recombinant 
polypeptide can be an immunoglobulin heavy chain, or an immunologically active 
portion thereof. In this embodiment, the oil-body-targeting-protein can comprise 
protein A, protein L or protein G. 

25 Also provided herein are isolated oil bodies comprising 

(a) a first fusion protein comprising a first oil-body-targeting-protein fused to a 
first recombinant polypeptide; and 

(b) a second fusion protein comprising a second oil-body-targeting-protein fused 
to a second recombinant polypeptide, 

30 wherein said first and second recombinant polypeptide are capable of forming a 
multimeric-protein-complex. The oil-body-targeting-protein can be selected from 
an oil-body-protein or an immunoglobulin, and the oil-body-protein can be an 
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oleosin or a caleostn. The multimeric-fusion-protein can be a heteromultimeric- 
fusion-protein, wherein said first and second recombinant polypeptide form an 
enzymatically active heteromultimeric redox complex or an immunoglobulin. In a 
particular embodiment, the first recombinant polypeptide is a thioredoxin and the 
5 second recombinant polypeptide is a thioredoxin-reductase. The thioredoxin can 
be selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194; and the thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID 
NOs:1 95-313. in another embodiment, the first recombinant polypeptide can be 
10 an immunoglobulin-polypeptide-chain. For example, the first recombinant 

polypeptide can be an immunoglobulin light chain, or an immunologically active 
portion thereof, and the second recombinant polypeptide can be an 
immunoglobulin heavy chain, or an immunologically active portion thereof. In 
this embodiment, the oil-body-targeting-protein can comprise protein A, protein L 

15 or protein G. 

Also provided are cells and transgenic plants comprising oil bodies, 
multimeric-protein-cornplexes, and multimeric-fusion-proteins, set forth herein. 
In one embodiment, the first recombinant polypeptide can be an immunoglobulin- 
polypeptide-chain. For example, the first recombinant polypeptide can be an 
20 immunoglobulin light chain, or an immunologically active portion thereof, and the 
second recombinant polypeptide can be an immunoglobulin heavy chain, or an 
immunologically active portion thereof. In this embodiment, the oil-body- 
targeting-protein can comprise protein A, protein L or protein G. In 
embodiments, wherein said first recombinant polypeptide is a thioredoxin and 
25 said second recombinant polypeptide is a thioredoxin-reductase, the methods 
described herein can be used to formulate the oil bodies for use in the 
preparation of a food product, personal care product or pharmaceutical 
composition. These formulations can further comprise the addition of NADP or 
NADPH. The food product can be a milk or wheat based food product. The 
30 personal care product can reduce the oxidative stress to the surface area of the 
human body or can be used to lighten the skin. The pharmaceutical composition 
can be used to treat chronic obstructive pulmonary disease (COPD), cataracts, 
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diabetes, envenomation, bronchiopulmonary disease, malignancies, psoriasis, 
reperfusion injury, wound healing, sepsis, Gl bleeding, intestinal bowel disease 
(IBD), ulcers, GERD (gastro esophageal reflux disease). 

Also provided herein are compositions comprising isolated oil bodies, 
5 thioredoxin and thioredoxin-reductase, wherein said thioredoxin can be selected 
from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID N0s:52- 
1 94, and said thioredoxin-reductase can be selected from the group consisting 
of those set forth in SEQ ID NOs:8 f 9, 10, 40, 44, 48, 50 and SEQ ID NOs:195- 
313. The composition can further comprise NADP or NADPH. In another 
10 embodiment, the composition comprises a first recombinant polypeptide that can 
be an immunoglobuiin-polypeptide-chain and a second recombinant polypeptide. 
For example, the first recombinant polypeptide can be an immunoglobulin light 
chain, or an immunologically active portion thereof, and the second recombinant 
polypeptide can be an immunoglobulin heavy chain, or an immunologically active 
15 portion thereof. In this embodiment, the oil-body-targeting-protein can comprise 

protein A, protein L or protein G. 

Also provided are multimeric-fusion-proteins, wherein the fusion-protein 
contains two or more polypeptide chains selected from the group of proteins set 
forth in Figure 5. Methods are also provided of reducing allergenicity of a food 

20 comprising the steps of providing the isolated oil bodies set forth herein; and 
adding the isolated oil bodies to the food, whereby allergenicity of the food is 
reduced. The food can be selected from the group consisting of wheat flour, 
wheat dough, milk, cheese, yogurt and ice cream. The various methods of 
treating food can further comprise providing NADH as a co-factor in the 

25 substantial absence of NADPH. 

Also provided herein are methods of treating or protecting a target 
against oxidative stress, comprising the steps of providing the recombinant 
redox fusion polypeptide comprising thioredoxin and thioredoxin-reductase; and 
contacting the recombinant fusion polypeptide with a target, wherein the target 

30 is susceptible to oxidative stress, thereby treating or protecting against the 

stress. The target can be selected from the group consisting of a molecule, a 
molecular complex, a cell, a tissue, and an organ. 
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Also provided herein are methods for preparing an enzymatically active 
redox protein associated with oil bodies comprising: 

a) producing in a cell a redox fusion polypeptide comprising a first 
redox protein linked to a second redox protein; 
5 b) associating said redox fusion polypeptide with oil bodies through 

an oil-body-targeting-protein capable of associating with said redox fusion 
polypeptide and said oil bodies; and 

c) isolating said oil bodies associated with said redox fusion 
polypeptide. The first redox protein can be a thioredoxin and the second redox 
10 protein can be a thioredoxin-reductase. 

Also, provided herein are methods of producing an immunoglobulin, said 
method comprising: (a) producing in a ceil comprising oil bodies, a first 
immunoglobulin-polypeptide-chain and a second immunoglobulin-polypeptide- 
chain wherein said first immunogiobulin-polypeptide-chain is capable of 
15 associating with said second immunoglobulin-polypeptide-chain to form said 
immunoglobulin; and (b) associating said immunoglobulin with an oil body 
through an oil-body-targeting-protein capable of associating with said oil bodies 
and said first immunoglobulin-polypeptide-chain. For example, the first 
immunoglobulin-polypeptide-chain can be an immunoglobulin light chain, or an 
20 immunologically active portion thereof, and the second immunoglobuiin- 

polypeptide-chain can be an immunoglobulin heavy chain, or an immunologically 
active portion thereof. In this embodiment, the oil-body-targeting-protein can 
comprise protein A, protein L or protein G. 

Also provided herein are methods for preparing a redox protein or an 
25 immunoglobulin associated with oil bodies comprising: 

a) introducing into a cell a chimeric nucleic acid sequence comprising: 

1) a first nucleic acid sequence capable of regulating 
transcription in said cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant 
30 fusion polypeptide comprising (i) a nucleic acid sequence encoding 

a sufficient portion of an oil-body-protein to provide targeting of 
said recombinant fusion polypeptide to an oil body linked to (ii) a 
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nucleic acid sequence encoding a redox fusion polypeptide 
comprising a first redox protein linked to a second redox protein, 
or a nucleic acid sequence encoding a immunoglobulin comprising 
a first immunoglobulin-polypeptide-chain linked to a second 
immunoglobulin-polypeptide-chain, operatively linked to; 
3) a third nucleic acid sequence capable of terminating 
transcription in said cell; 

growing said cell under conditions to permit expression of said 
redox fusion polypeptide or immunoglobulin in a progeny cell comprising oil 
10 bodies; and 

c) isolating from said progeny cell said oil bodies comprising said 
redox fusion polypeptide or immunoglobulin. In certain embodiments, positioned 
between said nucleic acid sequence encoding a sufficient portion of an oil-body- 
protein and said nucleic acid sequence encoding a redox fusion polypeptide or 

15 immunoglobulin can be a linker nucleic acid sequence encoding an oil-body- 
surface-avoiding linker amino acid sequence. The oil-body-surface-avoiding 
linker amino acid sequence can be substantially negatively charged or have a 
molecular weight of at least 35 kd. Optionally, the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 

20 is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 
and said nucleic acid sequence encoding a redox fusion polypeptide. In this 
optional embodiment, also contemplated is the introduction of an enzyme or 
chemical that cleaves said redox fusion polypeptide from said oil body, thereby 

25 obtaining isolated redox fusion polypeptide. The first redox protein can be a 
thioredoxin and said second redox protein can be a thioredoxin-reductase. In 
one embodiment, the thioredoxin and thioredoxin-reductase can be obtained 
from Arabidops/s. In another embodiment, the first redox protein is at least 5 
times more active when produced as a redox fusion polypeptide as compared to 

30 the production of the first redox protein without the second redox protein. 

Also provided herein, for use with the various methods set forth herein is 
the formulation of an emulsion of the oil bodies associated with the redox fusion 
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polypeptide for use in the preparation of a product capable of treating oxidative 
stress in a target, a product capable of chemically reducing a target, 
pharmaceutical composition, a personal care product or a food product. 
Accordingly, an emulsion formulation composition is provided. 
5 Also provided herein is a chimeric nucleic acid comprising: 

1) a first nucleic acid sequence capable of regulating transcription in 

a host cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a nucleic acid sequence encoding a sufficient portion 

10 of an oil-body-protein to provide targeting of said recombinant fusion polypeptide 
to an oil body linked to (ii) a nucleic acid sequence encoding a redox fusion 
polypeptide comprising a first redox protein linked to a second redox protein 

operatively linked to; 

3) a third nucleic acid sequence capable of terminating transcription 
15 in said cell. The oil-body-protein can be an oieosin or a caleosin, the first redox 

protein can be a thioredoxin and said second redox protein can be a thioredoxin- 
reductase. In certain embodiments, positioned between said nucleic acid 
sequence encoding a sufficient portion of an oil-body-protein and said nucleic 
acid sequence encoding a redox fusion polypeptide is a linker nucleic acid 

20 sequence encoding an oil-body-surface-avoiding linker amino acid sequence. 
The oil-body-surface-avoiding linker amino acid sequence can be substantially 
negatively charged, or have a molecular weight of at least 35 kd. In one 
embodiment, the gene fusion optionally further comprises a linker nucleic acid 
sequence encoding an amino acid sequence that is specifically cleavable by an 

25 enzyme or a chemical, wherein the linker sequence is positioned between the oil- 
body-surface-avoiding linker amino acid sequence and said nucleic acid sequence 
encoding a redox fusion polypeptide. 

Also provided herein are transgenic plants, e.g., safflower plants, 
comprising any of the chimeric nucleic acid sequences and constructs described 

30 herein. The chimeric nucleic acids can be contained within a plastid. 

Accordingly, isolated plastids are provided having chimeric nucleic acids therein. 
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Also provided are plant seeds comprising the chimeric nucleic acids provided 
herein. 

Also provided are oil body preparations obtained using any of the 
methods provided herein, and food products, pharmaceutical compositions, and 
5 personal care products containing the oil body preparations. The products 

and/or compositions provided herein are capable of treating oxidative stress in a 
target, capable of chemically reducing a target. Also provided is a detergent 
composition comprising an oil body preparation capable of chemically reducing a 
target, and related methods of cleansing an item, comprising administering such 

10 product to the item under conditions that promote cleansing. 

Also provided herein are nucleic acid constructs comprising a gene fusion, 
wherein the gene fusion comprises a first region encoding an oil-body-protein or 
an active fragment thereof, operably linked to a second region encoding at least 
one thioredoxin-reiated protein or an active fragment thereof. In one 

15 embodiment, the at least one thioredoxin-reiated protein can be thioredoxin. The 
thioredoxin can be selected from the group consisting of SEQ ID MOs:38, 42, 
46, 50 and SEQ ID NOs:52-194. The thioredoxin can be obtained from 

Arabidopsis or wheat. 

In another embodiment, the at least one thioredoxin-reiated protein can be 

20 thioredoxin-reductase. The thioredoxin-reductase can be selected from the group 
consisting of those set forth in SEQ ID NOs:8, 9,10, 40, 44, 48, 50 and SEQ ID 
NOs:195-313 and/or derived from Arabidopsis or wheat. The thioredoxin- 
reductase can be an NADPH-dependent thioredoxin-reductase.. The second 
region can encode a thioredoxin and thioredoxin-reductase. In one embodiment, 

25 the thioredoxin . and thioredoxin-reductase is obtained from Mycobacterium 

leprae. In another embodiment, the at least one thioredoxin-reiated protein can 
be an engineered fusion protein. The first region can precede, in a 5' to 3' 
direction, the second region. Alternatively, the first region follows, in a 5' to 3' 
direction, the second region. The gene fusion can optionally further comprise a 

30 third region encoding a second thioredoxin-reiated protein or an active fragment 
thereof, operably linked to the first region, or to the second region, or to both. A 
seed-specific promoter, such as a phaseolin promoter, can be operably linked to 
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the gene fusion. In one embodiment, at least one thioredoxin-related protein is 
derived from a plant species selected from the group consisting of Arabidopsis 
and wheat. In another embodiment, at least one thioredoxin-related protein can 

be derived from E. colL 
5 In one embodiment, the gene fusion further comprises a nucleic acid 

sequence encoding an oil-body-surface-avoiding linker amino acid sequence, 
wherein the linker amino acid sequence is positioned between the first region 
and the second region. The oil-body-surface-avoiding linker amino acid sequence 
can be substantially negatively charged, or have a molecular weight of at least 

10 35 kd. In addition, the gene fusion can further comprise a linker nucleic acid 
sequence encoding an amino acid sequence that is specifically cleavable by an 
enzyme or a chemical, wherein the linker sequence is positioned between the oil- 
body-surface-avoiding linker amino acid sequence and the second region. 
Also provided herein are transgenic plants containing a nucleic acid 

15 construct comprising a gene fusion, wherein the gene fusion comprises a region 
encoding an oil-body-protein or an active fragment thereof, operably linked to a 
region encoding a first thioredoxin-related protein or an active fragment thereof. 
The thioredoxin-related protein can be thioredoxin. The nucleic acid construct 
can be contained within a plastid. In one embodiment, when the first 

20 thioredoxin-related protein is thioredoxin and the construct can further comprise 
a region encoding a thioredoxin-reduetase. The gene fusion can optionally 
further comprise a third region encoding a second thioredoxin-related protein or 
an active fragment thereof, operably linked to the first region, or to the second 
region, or to both. The gene fusion can optionally further comprise a nucleic 

25 acid sequence encoding an oil-body-surface-avoiding linker amino acid sequence, 
wherein the nucleic acid encoding the linker amino acid sequence is positioned 
between the region encoding an oil-body-protein and the region encoding a first 
thioredoxin-related protein. The oil-body-surface-avoiding linker amino acid 
sequence can be substantially negatively charged, or have a molecular weight of 

30 at least 35 kd. The gene fusion can optionally further comprise a linker nucleic 
acid sequence encoding an amino acid sequence that is specifically cleavable by 
an enzyme or a chemical, wherein the linker sequence is 
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positioned between the oil-body-surface-avoiding linker amino acid sequence and 
the region encoding a first thioredoxin-related protein. 

Also provided is a transgenic plant comprising a nucleic acid construct, a 
seed-specific promoter operably linked to a gene fusion, wherein the gene fusion 
5 comprises a region encoding an oil-body-protein or an active fragment thereof, 
operably linked to a region encoding a first thioredoxin-related protein or an 
active fragment thereof, wherein a fusion protein comprising activities of oleosin 
and the thioredoxin-related protein is produced in a seed of the plant. In another 
embodiment, a thioredoxin-related protein having concentration of at least about 

10 0.5% of total cellular seed protein is provided. Also provided herein is an 

extract comprising an activity of a thioredoxin-related protein. Also provided are 
oil bodies and/or oil obtained from various seeds. 

Also provided herein are methods of making a fusion protein comprising a 
thioredoxin-related activity, the method comprising the steps of: 

15 a ) providing a transgenic plant comprising a nucleic acid construct 

comprising a seed-specific promoter operably linked to a gene fusion, wherein 
the gene fusion comprises a region encoding an oil-body-protein or an active 
fragment thereof, operably linked to a region encoding a first thioredoxin-related 
protein or an active fragment thereof, the gene fusion encoding a fusion protein 

20 comprising a thioredoxin-related activity; 

b) obtaining seeds from the plant; and 

c) recovering the fusion protein by isolating oil bodies from the seeds. In 
one embodiment, the oil bodies are fractionated to achieve partial purification of 
the fusion protein. The oil bodies can be in association with a fusion protein. 

25 The oil-body-protein can be cleaved from the thioredoxin-related protein after 

fractionation of the oil bodies. The cleaving step can make use of a protease or 
chemical proteolysis. 

Also provided herein are methods of reducing allergenicity of a food 

comprising the steps of: 
30 a) providing a preparation comprising oil bodies associated with a fusion 

protein, the fusion protein comprising an oil-body-protein or an active fragment 
thereof and a thioredoxin-related protein or an active fragment thereof; and 
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b) adding the preparation to the food, whereby allergenicity of the food is 
reduced due to activity of the thioredoxin-related protein or fragment. The food 
can be wheat flour, wheat dough, milk, cheese, yogurt and ice cream. In one 
embodiment, NADH is used as a co-factor in the substantial absence of NADPH. 
5 Also provided herein are pharmaceutical compositions comprising a fusion 

protein, the fusion protein comprising an oil-body-protein or an active fragment 
thereof and a thioredoxin-related protein or an active fragment thereof, in a 
pharmaceutical^ acceptable carrier. The oil bodies can be associated with the 
fusion protein. Also provided is a cosmetic formulation comprising oil bodies 

10 associated with a fusion protein, the fusion protein comprising an oil-body- 
protein or an active fragment thereof and a thioredoxin-related protein or an 
active fragment thereof, in a pharmaceutical^ acceptable carrier. Also provided 
are methods of treating or protecting a target against oxidative stress, 
comprising the steps of: 

15 a) providing a preparation comprising a fusion protein, the fusion protein 

comprising an oil-body-protein or an active fragment thereof and a thioredoxin- 
related protein or an active fragment thereof; and 

b) contacting the preparation with a target, wherein the target is 
susceptible to oxidative stress, thereby treating or protecting against the stress. 

20 The target can be selected from the group consisting of a molecule, a molecular 
complex, a cell, a tissue, and an organ. 

Also provided is a nucleic acid construct comprising a gene fusion, 
wherein the gene fusion comprises a first region encoding an oil-body-protein or 
an active fragment thereof, operably linked to a second region encoding at least 

25 one polypeptide or an active fragment thereof, and an oil-body-surface-avoiding 
linker in frame between the first and second region polypeptides. Also provided 
are methods of expressing this construct into the encoded amino acid sequence; 
and oil bodies, formulations, emulsions, cells, and plants comprising the 
construct and encoded amino acid sequence. These particular constructs, oil 

30 bodies, formulations, emulsions, cells, and plants can be produced according to 
the methods described herein. The second region can encode any polypeptide, 
for example, a therapeutically, nutritionally, industrially or cosmetically useful 
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peptide as set forth herein. For example, the second region can encode a redox 
protein, an immunoglobulin, a thioredoxin-related protein or any one or more 
recombinant polypeptides of a multimeric-protein-complex. 

Other features and advantages of the present invention will become 
5 readily apparent from the following detailed description. It should be understood 
however that the detailed description and the specific examples while indicating 
particular embodiments of the invention are given by way of illustration only. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figure 1 shows a ClustalW Formatted Alignment comparison of the 

published NADPH thioredoxin-reductase nucleic acid sequence (SEQ ID NO:9) 
(ATTHIREDB-Jacquot et al. J. Mol. Biol. (1994) 235 (4): 1 357-63.) with the 
sequence isolated herein in Example 1 (TR; SEQ ID NO:8). 

Figure 2 shows a ClustalW Formatted Alignment comparison of the 

15 deduced amino acid sequence of the published NADPH thioredoxin-reductase 
sequence (SEQ ID NO:1 2)(ATTHlREDB Jacquot et al. J. Mol. Biol. (1994) 235 
(4):1 357-63.) with the sequence isolated herein in Example 1 (TR; SEQ ID 
NO:13). 

Figure 3 shows a clustal alignment comparing the amino acid sequence of 
20 the Arabidopsis thaliana thioredoxin-reductase-linker-thioredoxin synthetic fusion 
(Arab TR-link-Trxh; SEQ ID NO:37) to the Mycobacterium leprae thioredoxin- 
reductase-thioredoxin natural fusion (M.lep TR/Trxh; SEQ ID NO:36) natural 
fusion. Overall, the proteins are approximately 50% identical at the amino acid 
level. 

25 Figure 4 is a bar graph showing the thioredoxin/thioredoxin-reductase 

activity measurements for the various transgenic Arabidopsis seed fractions. 
Relative specific activity is expressed as a percentage of the E. coii thioredoxin 
and thioredoxin-reductase activities. The numbered bars in the graph correspond 
to the following: 

30 1. W.T. + oleosin-thioredoxin 

2. W.T. + thioredoxin-oleosin 

3. W.T. + thioredoxin 
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4. W.T. + oleosin-thioredoxin-reductase 

5. W.T. + thioredoxin-reductase-oleosin 

6. W.T. + thioredoxin-reductase 

7. thioredoxin + oleosin-thioredoxin-reductase 
5 8. thioredoxin + thioredoxin-reductase-oleosin 

9. thioredoxin + thioredoxin-reductase 

10. thioredoxin-reductase + oleosin-thioredoxin 

1 1 . thioredoxin-reductase + thioredoxin-oleosin 

12. oleosin-M./ep TR/Trxh 

10 13. E. coli thioredoxin-reductase -f thioredoxin 

Figure 5 provides a listing of exemplary proteins for use in the 
heteromultimeric-fusion-proteins and heteromultimeric-protein-complexes 

provided herein. 
DETAILED DESCRIPTION 

15 As hereinbefore mentioned, the present invention relates to novel and 

improved methods for the production of multimeric proteins, including a first and 
second recombinant polypeptide, multimeric-protein-complexes, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulin-polypeptide-chains, 

20 immunoglobulins, redox-fusion-polypeptides, and a first and second thioredoxin- 
related protein; and related products. These methods permit the production of 
active multimeric-protein-complexes in association with oil bodies. The oil 
bodies in association with the multimeric-protein-complex may be used to 
prepare various useful emulsions. 

25 Accordingly, provided herein are methods of producing a recombinant 

multimeric-protein-complex associated with an oil body, said method comprising: 

(a) producing in a cell comprising oil bodies, a first recombinant 
polypeptide and a second recombinant polypeptide wherein said first 
recombinant polypeptide is capable of associating with said second recombinant 

30 polypeptide in the cell to form said multimeric-protein-complex; and 
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(b) associating said multimeric-protein-complex with an oil body through 
an oil-body-targeting-protein capable of associating with said oil body and said 
first recombinant polypeptide. 

Definitions and terms 

5 Unless defined otherwise, all technical and scientific terms used herein 

have the same meaning as is commonly understood by one of skill in the art to 
which this invention belongs. Where permitted, all patents, applications, 
published applications and other publications and sequences from GenBank, 
SwissPro and other data bases referred to throughout in the disclosure herein are 
10 incorporated by reference in their entirety. 

As used herein, the phrase "multimeric-protein-complex", refers to two or 
more polypeptide chains that permanently or repeatedly interact or permanently 
or repeatedly coordinate to form a biologically active assembly comprising said 
two or more polypeptide chains. It should be noted that the polypeptides may 
15 be independently biologically active without interaction or coordination to form 
the complex. The multimeric-protein-complex may provide a biological structure, 
or it may be capable of facilitating a chemical or biological reaction. For 
example, one of the protein regions within the multimeric-protein-complex can 
repeatedly activate or repeatedly inactivate the biological or metabolic activity of 
20 one or more of the other proteins contained within the multimeric-protein- 
complex. In one embodiment, the first and second recombinant polypeptide 
contained in a multimeric-protein-complex may either associate or interact as 
independent non-contiguous polypeptide chains or the multimeric-protein- 
complex may be prepared as a fusion polypeptide (multimeric-fusion-protein) 
25 between the first and second recombinant polypeptide. 

One example of a repeated (e.g., reoccurring) interaction or association 
between the two or more polypeptides of a multimeric-protein-complex provided 
herein is the interaction between two or more non-identical redox proteins to 
form a heteromultimeric-protein-complex. Exemplary redox proteins for use in 
30 this regard are thioredoxin and the thioredoxin-reductase. A further example is 
the interaction between two or more immunoglobulin-polypeptide-chains to form 
an immunoglobulin. As used herein, the phrase "heteromultimeric-protein- 
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complex", refers to two or more non-identical polypeptide chains that 
permanently or repeatedly interact or permanently or repeatedly coordinate to 
form a biologically active assembly comprising said two or more polypeptide 
chains. Other examples of multimeric-protein-complexes provided herein include 
5 a first and second recombinant polypeptide, heteromultimeric-protein-complexes, 
multimeric-fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
first and second immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
and a first and second thioredoxin-related protein. 

The recombinant polypeptide or multimeric-protein-complex is associated 
10 with an oil body. As used herein, the phrase "oil body" or "oil bodies" refers to 
any oil or fat storage organelle in any cell type. Accordingly, the oil bodies may 
be obtained from any cell comprising oil bodies, including plant cells (described 
in for example: Huang (1992) Ann. Rev. Plant Mol. Biol. 43: 177-200), animal 
cells {described in for example: Murphy (1990) Prog Lipid Res 29(4): 299-324), 
15 including adipocytes, hepatocytes, steroidogenic cells, mammary epithelial cells, 
macrophages, algae cells (described in for example: Rossler (1988) J. Physiol. 
London, 24: 394-400) fungal cells, including yeast cells (described in for 
example Leber et al. (1994) Yeast 10: 1421-1428) and bacterial cells (described 
in for example: Pieper-Furst et al. (1994) J. Bacteriol. 176: 4328-4337). 
20 Preferably the oil bodies used herein are oil bodies obtainable from plant cells 
and more preferably the oil bodies obtainable from plant seed cells. 

As used herein, the phrase "is capable of associating with", "associate" 
or grammatical variations thereof, refers to any interaction between two or more 
polypeptides, including any covalent interactions (e.g. muftimeric-fusion-proteins) 
25 as well as non-covalent interactions. Exemplary non-covalent interactions can 
be between the oil-body-targeting-protein and a redox protein or immunoglobulin- 
polypeptide-chain, as well as between two or more different proteins contained 
within two or more separate oil-body-protein fusion proteins (e.g., the redox 
proteins in oleosin-thioredoxin and oleosin-thioredoxin-reductase). 
30 As used herein, the term "recombinant" (also referred to as heterologous) 

in the context of recombinant proteins and amino acids, means "of different 
natural origin" or represents a non-natural state. For example, if a host cell is 
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transformed with a nucleotide sequence derived from another organism, 
particularly from another species, that nucleotide sequence and amino acid 
sequence, encoded thereby, is recombinant (heterologous) with respect to that 
host cell and also with respect to descendants of the host ceil which carry that 
5 gene. Similarly, recombinant (or heterologous) refers to a nucleotide sequence 
derived from and inserted into the same natural, original cell type, but which is 
present in a non-natural state, e.g., a different copy number, or under the control 
of different regulatory elements. A transforming nucleotide sequence may 
include a recombinant coding sequence, or recombinant regulatory elements. 
10 Alternatively, the transforming nucleotide sequence may be completely 

heterologous or may include any possible combination of heterologous and 
endogenous nucleic acid sequences. 

In various embodiments of the present invention, the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 
1 5 protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
and/or thioredoxin-related proteins, are produced in a cell comprising oil bodies. 
As used herein the phrase "in a cell", "in the cell", or grammatical variations 
thereof, mean that the first and/or second recombinant polypeptides, multimeric- 
20 protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 
polypeptide-chains, redox-fusion-polypeptides, and/or thioredoxin-related 
proteins, may be produced in any cellular compartment of that cell, so long as 
that cell comprises oil bodies therein. In embodiments of the invention in which 
25 plant cells are used, the phrase is intended to include the plant apoplast. 

In various embodiments provided herein, the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 
protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
30 and thioredoxin-related proteins, associate with an oil body through an oil-body- 
targeting-protein. As used herein, the phrase "oil-body-targeting-protein" refers 
to any protein, protein fragment or peptide capable of associating with an oil 
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body. Exemplary oil-body-targeting-proteins for use herein include oil-body- 
proteins, such as oleosin and caleosin; immunoglobulins, such as bi-specific 

antibodies; and the like. 

In embodiments described herein in which an oil-body-protein is used, the 
5 first and/or second recombinant polypeptides, multimeric-protein-complexes, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and thioredoxin-related proteins, are 
preferably fused to the oil-body-protein. The term "oil-body-protein" refers to 
10 any protein naturally present in cells and having the capability of association 
with oil bodies, including any oleosin or caleosin. 

Accordingly, provided herein a method of expressing a recombinant 
multimeric-protein-complex comprising a first and second recombinant 
polypeptide in a cell, said method comprising: 
15 (a) introducing into a cell a first chimeric nucleic acid sequence comprising: 

(i) a first nucleic acid sequence capable of regulating transcription in said cell 
operatively linked to; 

(ii) a second nucleic acid sequence encoding a first recombinant polypeptide, 
such as a redox protein, an immunoglobulin-polypeptide-chain or an thioredoxin- 

20 related protein, fused to an oil-body-protein; 

(b) introducing into said cell a second chimeric nucleic acid sequence comprising: 

(i) a third nucleic acid sequence capable of regulating transcription in said cell 
operatively linked to; 

(ii) a fourth nucleic acid sequence encoding a second recombinant polypeptide, 
25 such as a second redox protein, a second immunoglobulin-polypeptide-chain or a 

second thioredoxin-related protein,; 

(c) growing said cell under conditions to permit expression of said first and 
second recombinant polypeptide in a progeny cell comprising oil bodies wherein 
said first recombinant polypeptide and said second recombinant polypeptide are 

30 capable of forming a multimeric-protein-complex, preferably in said progeny cell; 
and 
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(d) associating said first recombinant polypeptide with an oil body through said 
oil-body-protein. 

The term "nucleic acid" as used herein refers to a sequence of nucleotide 
or nucleoside monomers consisting of naturally occurring bases, sugars and 
5 intersugar (backbone) linkages. The term also includes modified or substituted 
sequences comprising non-naturally occurring monomers or portions thereof, 
which function similarly. The nucleic acid sequences may be ribonucleic acids 
(RNA) or deoxyribonucleic acids (DNA) and may contain naturally occurring 
bases including adenine, guanine, cytosine, thymidine and uracil. The sequences 
10 also may contain modified bases such as xanthine, hypoxanthine, 2- 

aminoadenine, 6-methyl, 2-propyl and other alkyl adenines, 5-halo-uracil. 5-halo 
cytosine, 6-aza uracil, 6-aza cytosine and 6-aza thymine, pseudo uracil, 4- 
thiouracil, 8-halo adenine, 8-amino adenine, 8-thiol-adenine, 8-thio-aIkyl 
adenines, 8-hydroxyl adenine and other 8-substituted adenines, 8-halo guanines, 
15 8 amino guanine, 8 thiol guanine, 8-thioalkyl guanines, 8 hydroxyl guanine and 
other 8-substituted guanines, other aza and deaza uracils, thymidines, cytosines, 
adenines, or guanines, 5-trifluoromethyl uracil and 5-trifluoro cytosine. 
Multimeric-protein-complexes 

In accordance with the methods and compositions provided herein, any 
20 two recombinant polypeptides capable of forming a multimeric-protein-complex 
may be used. The nucleic acid sequences encoding the two recombinant 
polypeptides may be obtained from any biological source or may be prepared 
synthetically. In general nucleic acid sequence encoding multimeric proteins are 
known to the art and readily available. Known nucleic acid sequences encoding 
25 multimeric-protein-complexes may be used to design and construct nucleic acid 
sequence based probes in order to uncover and identify previously undiscovered 
nucleic acid sequences encoding multimeric-protein-complexes, for example, by 
screening cDNA or genomic libraries or using 2- or multi-hybrid systems. Thus, 
additional nucleic acid sequences encoding multimeric-protein-complexes may be 
30 discovered and used as described herein. 

The first and/or second recombinant polypeptides that are comprised 
within a multimeric-protein-complex provided herein, can themselves be in the 
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form of heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or a first and/or second thioredoxin- 
related protein. 

5 The nucleic acid sequence encoding the first and second recombinant 

polypeptide, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or a first and/or second thioredoxin- 
related protein may be obtained from separate sources or may be obtained from 
10 the same source. In general however, such nucleic acid sequence is obtained 
from the same or a similar biological source. In certain embodiments wherein 
the nucleic acid sequence encoding the first and second recombinant polypeptide 
protein are obtained from the same source, the nucleic acid sequence encoding 
the first recombinant polypeptide and second recombinant polypeptide may be 

15 naturally fused. In accordance with a particular embodiment, the nucleic acid 
sequences encoding the first and second recombinant polypeptide are obtained 

from a plant source. 
Oil-Body-Surf ace-Avoid'mg Linkers 

Polypeptide spacers or linkers of variable length and/or negative charge 
20 can be used herein to separate the first and/or second recombinant polypeptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 
fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and the first 
and/or second thioredoxin-related proteins from the in-frame oil-body-targeting- 
25 protein, to improve activity of and/or the accessibility of the polypeptide or 

complex. For example, in one embodiment set forth herein, positioned between 
a nucleic acid sequence encoding a sufficient portion of an oil-body-protein and a 
nucleic acid sequence encoding either the first and/or second recombinant 
polypeptides, multimeric-protein-complexes, heteromultimeric-protein-complexes, 
30 multimeric-fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and the first 
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and/or second thioredoxin-related proteins; is a linker nucleic acid sequence 
encoding an oil-body-surface-avoiding linker amino acid sequence. 

Oil-body-surface-avoiding linkers are positioned between the oil-body 
targeting sequence and an in-frame recombinant polypeptide of interest, e.g., the 
5 multimeric-protein-complexes provided herein, serve to increase the distance and 
or decrease the interaction between the negatively charged oil body surface and 
the recombinant polypeptide of interest. A negatively charged linker is repelled 
by the negatively charged oil body surface, in turn increasing the distance or 
decreasing the interaction of its attached recombinant polypeptide with the oil 
10 body surface. As a consequence of the increased distance from the oil body 

surface, the recombinant polypeptide will be more accessible, e.g. to its target(s) 
substrate, protein substrate, protein partner, and less affected by the charged oil 
body surface. Exemplary linker sequences for use herein can be either a 
negatively charged linker, or a linker having a molecular weight of at least about 

15 35 kd or more. 

As used herein, a "negatively charged linker" sequence, refers to any 
amino acid segment, or nucleic acid encoding such, that has a pi less than or 
equal to the pi of an oil body. In certain embodiments, the pi of the negatively 
charged linker is about 90%, 80%, 70%, 60%, 50%, 40%, 30%, down to 
20 about 25% or more, below that of the pi of an oil body in the particular plant or 
cell system being used. Exemplary negatively charged linkers can be prepared 
comprising any combination of the negatively charged amino acid residues. For 
example, in one embodiment, a negatively charged linker comprises either a 
poly-glutamate or poly-aspartate sequence, or any combination of both amino 
25 acid residues. The negatively charged linker is typically at least 5, 10, 1 5, 20, 
25, 30, 35, 40, 45, 50, 55, 60, 66, 70, 75, 80, 85, 90, 95, 100 or more amino 
acids in length. The negatively charged linkers are preferably non-proteolytic 
(e.g., non-proteolytic linkers), having no site for efficient proteolysis. When 
linker size rather than charge is used to minimize interaction of the recombinant 
30 polypeptide of interest with the oil body surface, then the linker is non- 
proteolytic and ranges in molecular weight from about 35 kd up to about 100 
kd. The upper size limit is chosen such that the expression of, the activity of, 
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the conformation of, and/or the access to target of, the recombinant polypeptide 
of interest is not significantly affected by the linker. 

In certain embodiments, described herein where a non-proteolytic linker 
amino acid sequence is employed, the gene fusion or protein fusion (multimeric- 
5 fusion-protein) can optionally further comprise a linker nucleic or amino acid 
sequence encoding a sequence that is specifically cleavable by an enzyme or a 
chemical, wherein the linker sequence is positioned between the non-proteolytic 
linker sequence and sequence encoding the desired recombinant protein region, 
e.g., the first and/or second recombinant polypeptides, multimeric-protein- 
10 complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins set forth herein. When a cleavable linker sequence is used herein, in a 
particular embodiment, it is further downstream than the non-proteolytic linker 
15 sequence from the oil-body-targeting-protein region of the fusion protein. By 

virtue of cleavable linker, the recombinant fusion polypeptides provided herein, 
such as the multimeric-fusion-proteins and redox fusion polypeptides, can be 
isolated and purified by introducing an enzyme or chemical that cleaves said 
multimeric-fusion-protein and/or redox fusion polypeptide from said oil body, 
20 thereby obtaining and/or isolating the desired protein. It is contemplated herein 
that the use of cleavable linker sequence downstream of the non-proteolytic 
linker/spacer sequence will improve the yield of protein recovery when isolating 
or purifying proteins using the methods provided herein. 

The nucleic acid sequences encoding the first or second recombinant 
25 polypeptide may be altered to improve expression levels for example, by 

optimizing the nucleic acids sequence in accordance with the preferred codon 
usage for the particular cell type which is selected for expression of the first and 
second recombinant polypeptide, or by altering of motifs known to destabilize 
mRNAs (see for example: PCT Patent Application 97/02352). Comparison of 
30 the codon usage of the first and second recombinant polypeptide with codon 
usage of the host will enable the identification of codons that may be changed. 
For example, typically plant evolution has tended towards a preference for CG 
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rich nucleotide sequences while bacterial evolution has resulted in bias towards 
AT rich nucleotide sequences. By modifying the nucleic acid sequences to 
incorporate nucleic acid sequences preferred by the host cell, expression may be 
optimized. Construction of synthetic genes by altering codon usage is described 
5 in for example PCT patent Application 93/07278. The first and second 

recombinant polypeptide can be altered using for example targeted mutagenesis, 
random mutagenesis (Shiraishi et al. (1998) Arch. Biochem. Biophys. 358: 104- 
115; Galkin et al. (1997) Protein Eng. 10: 687-690; Carugo et al. (1997) 
Proteins 28: 10-28; Hurley et al. (1996) Biochemistry 35: 5670-5678), gene 
10 shuffling, and/or by the addition of organic solvent (Holmberg et al. (1999) 
Protein Eng. 12: 851-856). Any polypeptide spacers that are used in 
accordance with the methods and products provided herein may be altered in 
similar ways. 

In particular embodiments provided herein, the recombinant polypeptides 
15 or thioredoxin-related proteins capable of forming a multimeric-protein-complex 
are capable of forming a heteromultimeric-protein-complex. Examples of 
heteromultimeric-protein-complexes that contain polypeptide chains that 
repeatedly interact, either to activate, inactivate, oxidize, reduce, stabilize, etc., 
with one another, that can be produced in association with oil bodies using the 
20 methods provided herein include those set forth in Figure 5. Accordingly, 

exemplary proteins for use in the heteromultimeric-protein-complexes and nucleic 
acid constructs encoding such, provided herein include, among others described 
herein, those set forth in Figure 5. 

Other polypeptide regions that can be used in the first and/or second 
25 recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins, provided herein include, 
among other, those immunoglobulin regions set forth in Table 1 . 



30 
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TABLE 1 - ANTIBODY HETERODIMERS 

Class or molecule Subunits 

j=a"b Variable region and first constant region ot 

heavy chain and complete light chain 
5 p v Variable regions of heavy and light 

antibody chains 

, gA heavy chains, light chains and J (joining) 

chain 

IgG IgD IgE heavy and light chains 

)gM ' ' heavy chains, light chains and J (joining) 

chain 

Antibody chain(s) and a toxin Antibody chain(s) and a toxin 

10 Autoantigens, allergens and Autoantigens, allergens and transplant 

transplant antigens with an antigens with an adjuvant or tolerogen 
adjuvant or tolerogen 

Chimeras using antibody Fc Receptor subunits fused to the constant 

doma j n region of antibody heavy chains 



15 



20 



As set forth above, in one embodiment, exemplary heteromultimeric- 
protein-complexes and exemplary heteromultimeric-fusion-proteins provided 
herein comprise redox proteins, such as the thioredoxins and thioredoxin- 
reductases and immunoglobulins. 



Oil-body-targeting-proteins 

The nucleic acid sequence encoding the oil-body-targeting-protein that 
may be used in the methods and compositions provided herein may be any 
nucleic acid sequence encoding an oil-body-targeting-protein, protein fragment or 

25 peptide capable of association with first recombinant polypeptide, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or a first and/or second thioredoxin- 
related protein and the oil bodies. The nucleic acid sequence encoding the oil 

30 body targeting peptide may be synthesized or obtained from any biological 
source. 

For example, in one embodiment the oil-body-targeting-protein is an 
immunoglobulin or an immunoglobulin derived molecule, for example, a bispecific 
single chain antibody. The generation of single chain antibodies and bi-specific 
35 single chain antibodies is known to the art (see, e.g., US Patents US 5,763,733, 
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US5,767,260 and US5,260,203). Nucleic acid sequences encoding single chain 
antibodies functioning as oil-body-targeting-proteins may be prepared from 
hybridoma cell lines expressing monoclonal antibodies raised against an oleosin 
as described by Alting-Mees et al (2000) IBC's Annual International Conference 
5 on Antibody Engineering, Poster #1 . In order to attain specificity for the first 
recombinant polypeptide a nucleic acid sequence encoding a second single chain 
antibody prepared from a monoclonal raised against the first recombinant 
polypeptide may be prepared and linked to the anti-oleosin single chain antibody. 
In this embodiment the oil body associates with the first recombinant 
10 polypeptide through non-covalent interactions of the oil-body-targeting-protein 
with the first recombinant polypeptide and the oil body. Alternatively the first 
recombinant polypeptide may be prepared as a fusion protein with an oil-body- 
targeting-protein. For example, a nucleic acid sequence encoding a single chain 
antibody raised against an oleosin may be fused to a nucleic acid sequence 

15 encoding the first recombinant polypeptide 

Non-immunoglobulin-based oil-body-targeting-proteins capable of 
association with the first recombinant polypeptide may be discovered and 
prepared using for example phage display techniques (Pharmacia Biotech 
Catalogue Number 27-9401-011 Recombinant Phage Antibody System 

20 Expression Kit). 

Oil-body-targeting-proteins may also be chemically modified. For 
example, oleosins may be modified by changing chemical modification of the 
lysine residues using chemical agents such as biotinyl-N-hydroxysuccinimide 
ester resulting in a process referred to as biotinylatidn. Conveniently this is 

25 accomplished by in vitro biotinylation of the oil bodies. In vivo biotinylation may 
be accomplished using the biotinylation domain peptide from the biotin carboxy 
carrier protein of E. coli acetyl-CoA carboxylase (Smith et al. (1998) Nucl. Acids. 
Res. 26: 1414-1420). Avidin or streptavidin may subsequently be used to 
accomplish association of the redox protein with the oil body. 

30 |n a particular embodiment the oil-body-targeting-protein is an oil-body- 

protein such as for example an oleosin or a caleosin or a sufficient portion 
derived thereof capable of targeting to an oil body. Nucleic acid sequences 
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encoding oleosins are known to the art. These include for example the 
Arabidopsis oleosin (van Rooijen et al (1991) Plant Mol. Bio. 1 8: 1 1 77-1 1 79); the 
maize oleosin (Qu and Huang (1990) J. Biol. Chem. Vol. 265 4:2238-2243); 
rapeseed oleosin (Lee and Huang (1991) Plant Physiol. 96:1395-1397); and the 
5 carrot oleosin (Hatzopoulos et al (1990) Plant Cell Vol. 2, 457-467.). Caleosin 
nucleic acid sequences are also known to the art (Naested et al (2000) Plant Mol 
Biol. 44(4):463-476; Chen et al (1999) Plant Cell Physiol. 40(1 0):1 079-1 086). 
Animal cell derived oil body proteins that may be used herein include adopihilin 
(Brasaemle et al, (1997) J. Lipid Res., 38: 2249-2263; Heid et al. (1998) Cell 
10 Tissue Research 294: 309-321), perilipin (Blanchette-Mackie et al. (1995), J. 
Lipid Res. 36: 1 21 1 -1 226; Servetnick et al. (1 995) J. Biol. Chem. 270: 1 6970- 
16973), apolipoproteins such as apo A-l, A-li, A-IV, C-l, C-ll, Cill (Segrest et al. 
(1990), Proteins 8:103-117) and apoB (Chatterton et al. (1995) J. Lipid Res. 36: 
2027-2037; Davis, RA in: Vance DE, Vance J. editors. Lipoprotein structure and 
15 secretion. The Netherlands, Elsevier, 191: 403-426. 

In one embodiment, the first recombinant polypeptide- is fused to an oil- 
body-protein. The methodology is further described in US patent 5,650,554, 
which is incorporated herein by reference in its entirety. The first recombinant 
polypeptide may be fused to the N-terminus as well as to the C-terminus of the 
20 oil-body-protein (as described in: Moloney and van Rooijen (1996) INFORM 
7:107-1 13) and fragments of the oil-body-protein such as for example the 
central domain of an oleosin molecule, or modified versions of the oil-body- 
protein may be used. In this embodiment, the second recombinant polypeptide 
is expressed intracellular^ and then intracellularlly associates with the first 
25 recombinant polypeptide to form the multimeric-protein-complex in the cell. Oil 
bodies comprising the multimeric-protein-complex are then conveniently isolated 
from the cells. 

In a further embodiment both the first and second recombinant 
polypeptide are separately fused to an oil-body-protein. In this embodiment 
30 nucleic acid sequences encoding the first and second polypeptides may be 
prepared separately and introduced in separate cell lines or they may be 
introduced in the same cell lines. Where the nucleic acid sequences are 
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introduced in the same cell line, these nucleic acid sequence may be prepared 
using two separate expression vectors, or they may be prepared using a single 
vector comprising nucleic acid sequences encoding both the first polypeptide 
fused to an oil body protein and the second polypeptide fused to an oil-body- 
5 protein. Where separate cell lines are used subsequent mating of the offspring 
(e.g.. mating of plants) is used to prepare a generation of cells comprising oil 
bodies which comprise both the first and second recombinant polypeptide fused 
to an oil-body-protein. 

In further alternate embodiment, the first and second recombinant 
10 polypeptide are fused to form a multimeric-fusion-protein comprising the 
multimeric-protein-complex. In such an embodiment, the first and second 
polypeptide is associated with the oil body through an oil-body-targeting-protein 
capable of associating with both the fusion protein and with the oil body. In a 
particular embodiment, the fusion protein comprising the multimeric-protein- 
1 5 complex is fused to an oil-body-protein, for example, an oleosin or caleosin. 

In embodiments provided herein in which the multimeric-protein-complex 
is an immunoglobulin (e.g., a multimeric-immunoglobulin-complex), a particularly 
preferred oil body targeting protein is an oleosin or caleosin associated with an 
immunoglobulin binding protein, such as for example protein A (US Patent 
20 5,1 51 ,350), protein L (US Patent 5,965,390) and protein G (US Patent 

4,954,618), or active fragments of such immunoglobulin binding proteins. 

New oil-body-proteins may be discovered for example by preparing oil 
bodies (described in further detail below) and identifying proteins in these 
preparations using for example SDS gel electrophoresis. Polyclonal antibodies 
25 may be raised against these proteins and used to screen cDNA libraries in order 
to identify nucleic acid sequences encoding oil-body-proteins. The 
methodologies are familiar to the skilled artisan (Huynh et al. (1985) in DNA 
Cloning Vol. 1. a Practical Approach ed. DM Glover, IRL Press, pp 49-78). New 
oil-body-proteins may further be discovered using known nucleic acid sequences 
30 encoding oil-body-proteins (e.g. the Arabidopsis, rapeseed, carrot and corn 

nucleic acid sequences) to probe for example cDNA and genomic libraries for the 
presence of nucleic acid sequences encoding oil-body-proteins. 
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In one embodiment, the first and second polypeptide are a first and 
second redox protein. Accordingly, one embodiment provided herein relates to 
novel and improved methods for the production of redox proteins. It has 
uneX p ec tedly been found that a redox protein when prepared as a fusion protein 
5 with a second redox protein is fully enzymatically active when produced in 
association with an oil body. In contrast, when the redox protein is prepared 
without the second redox protein it has reduced enzymatic activity. In one 
embodiment, the first redox protein is at least 5 times more active when 
produced as a redox fusion polypeptide relative to production as a non-fusion 

10 polypeptide. 

Accordingly, provided herein are methods for producing an oil body 
associated with a heteromultimeric redox protein complex, said method 
comprising: 

(a) producing in a cell comprising oil bodies, a first redox protein and a 
15 second redox protein wherein said first redox protein is capable of interacting 

with said second redox protein, preferably in the cell, to form said 
heteromultimeric redox protein complex; and 

(b) associating said heteromultimeric redox protein complex with an oil 
body through an oil-body-targeting-protein capable of associating with said oil 

20 bodies and said heteromultimeric redox protein complex. 

In a particular embodiment the first and second redox protein are 
prepared as a fusion protein to form a redox fusion polypeptide. Accordingly, 
provided herein are methods for preparing an enzymatically active redox protein 
associated with oil bodies comprising: 

26 a) producing in a cell a redox fusion polypeptide comprising a first redox 

protein linked to a second redox protein; 

b) associating said redox fusion polypeptide with oil bodies through an 
oil-body-targeting-protein capable of associating with said redox fusion 
polypeptide and said oil bodies; and 
30 c) isolating said oil bodies associated with said redox fusion polypeptide. 

The oil bodies in association with the redox protein may be used to prepare a 
variety of useful emulsions. 
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As used herein the phrase "redox proteins" or grammatical variations 
thereof, refers to any protein or active protein fragment capable of participating 
in electron transport. For example, redox proteins are capable of catalyzing the 
transfer of an electron from an electron donor (also frequently referred to as the 
5 reducing agent) to an electron acceptor (also frequently referred to as the 

oxidizing agent). In the process of electron transfer, the reducing agent (electron 
donor) is oxidized and the oxidizing agent (electron acceptor) is reduced. 
Exemplary redox proteins for use herein include iron-sulfur proteins, 
cytochromes, redox active thiol proteins and redox-active flavoproteins. To 
10 carry out their function as conduits for electrons, redox proteins, such as 

thioredoxin and thioredoxin-reductase for example, are known to function by 
interacting or associating with one another in multimeric-protein-complexes (e.g., 
heteromultimeric-protein-complexes). 

The term "redox fusion polypeptide" as used herein refers to any fusion 
15 polypeptide comprising a first redox protein linked to a second redox protein 
(e.g., an in-frame translational fusion). The redox proteins that may be used 
with the methods and compositions provided herein may be any redox protein. 
In one embodiment the first and second redox proteins are a pair of redox 
proteins that would normally occur together from the same source, in nature. In 
20 a particular embodiment, the first redox protein is a thioredoxin and the second 
redox protein is a thioredoxin-reductase. 

The redox fusion polypeptide may be produced in any cell comprising oil 
bodies, including any animal cell, plant cell, algae cell, fungal cell or bacterial 
cell. In certain embodiments the redox fusion polypeptide is produced in a plant 
25 cell and in particular embodiments the redox fusion polypeptide is produced in 

the seed cells of a seed plant. 

In particular embodiments the oil-body-targeting-protein that is used is an 
oil-body-protein. In embodiments of the present invention in which an oil-body- 
protein is used, the first and second redox protein are preferably covalently fused 
30 to the oil-body-protein. Accordingly, provided herein are methods for the 
preparation of a redox protein in association with an oil body comprising: 

a) introducing into a cell a chimeric nucleic acid sequence comprising: 



RECTIFIED SHEET (RULE 91) 



WO 02/50289 



PCT7US01/50240 



-35- 

1) a first nucleic acid sequence capable of regulating transcription in 
said cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a first nucleic acid sequence encoding a 

5 sufficient portion of an oil-body-protein to provide targeting of said 

recombinant fusion polypeptide to an oil body linked in reading 
frame to (ii) a second nucleic acid sequence encoding a redox 
fusion polypeptide comprising a first redox protein linked to a 
second redox protein operatively linked to; 
10 3} a third nucleic acid sequence capable of terminating transcription 

in said cell; 

b) growing said cell under conditions to permit expression of said 
redox fusion polypeptide in a progeny cell comprising oil bodies; 
and 

15 C ) isolating said oil bodies comprising said redox fusion polypeptide 

from said progeny cell. 

Redox Proteins 

In accordance with various methods and compositions provided herein, 
20 any nucleic acid sequence encoding a redox protein may be used. The nucleic 
acid sequence encoding the first and/or second redox protein may be obtained 
from any biological source or may be prepared synthetically. In general, nucleic 
acid sequences encoding redox proteins are well known in the art and readily 
available. See, for example: Cristiano et al. (1993) Genomics 17: (2) 348-354, 
25 Doyama et aL (1998) Plant Sci. 137: 53-62, Hoeoeg et al. (1984) Biosci. Rep. 4: 
917-923; as well as the Swiss Protein sequences set forth in Table 5. Known 
nucleic acid sequences encoding redox proteins may be used to design and 
construct nucleic acid sequence based probes in order to uncover and identify 
previously undiscovered nucleic acid sequences encoding redox proteins, for 
30 example by screening cDNA or genomic libraries. Thus, additional nucleic acid 
sequences may be discovered and used in accordance with the present 
invention. 
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The nucleic acid sequence encoding the first and/or second redox protein 
may be obtained from separate sources or may be obtained from the same 
source. In general however, the nucleic acid sequence encoding a redox-fusion 
polypeptide comprises nucleic acid sequences encoding a first and a second 
5 redox protein obtained from the same or a similar biological source. In certain 
embodiments provided herein, wherein the nucleic acid sequence encoding the 
first and second redox protein is obtained from the same source, the nucleic acid 
sequence encoding the first redox protein and second redox protein may be 
naturally fused. In accordance with a particular embodiment, the nucleic acid 
10 sequences encoding the first and second redox protein are preferably obtained 

from a plant source. 

As set forth above, a polypeptide spacer or linker of variable length may 
separate the first and second redox proteins from each other and/or from the oil- 
body-targeting-protein; and additional redox proteins (e.g., one or more) may be 
15 fused to the first and/or second redox protein. 

The nucleic acid sequences encoding the redox proteins may be altered to 
improve expression levels for example by optimizing the nucleic acids sequence 
in accordance with the preferred codon usage for the particular cell type which is 
. selected for expression of the redox proteins, or by altering of motifs known to 
20 destabilize mRNAs (see for example: PCT Patent Application 97/02352). 

Comparison of the codon usage of the redox protein with codon usage of the 
host will enable the identification of codons that may be changed. For example, 
typically plant evolution has tended towards a preference for CG rich nucleotide 
sequences while bacterial evolution has resulted in bias towards AT rich 
25 nucleotide sequences. By modifying the nucleic acid sequences to incorporate 
nucleic acid sequences preferred by the host cell, expression may be optimized. 
Construction of synthetic genes by altering codon usage is described in for 
example PCT patent Application 93/07278. The redox proteins may be altered 
using for example, targeted mutagenesis, random mutagenesis (Shiraishi et al. 
30 (1998) Arch. Biochem. Biophys. 358: 104-115; Galkin et al. (1997) Protein Eng. 
10: 687-690; Carugo et al. (1997) Proteins 28: 10-28; Hurley et al. (1996) 
Biochemistry 35: 5670-5678) (and/or by the addition of organic solvent 
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(Holmberg et al. (1999) Protein Eng. 12: 851-856). The polypeptide spacer 
between the first and second redox protein may be altered in similar ways. 

The first and second redox protein may be selected by developing a two- 
dimensional matrix and determining which combination of first and second redox 
5 protein is most effective in electron transport using for example, a colorometric 
reduction assay (Johnson et al (1984) J. of Bact. Vol. 158 3:1061-1069, 
Luthman et al (1982) Biochemistry Vol 21 26:6628-2233). Combinations of 
thioredoxin and thioredoxin-reductase may be tested by determining the 
reduction of wheat storage proteins and milk storage protein beta-lactoglobulin in 
10 vitro (Del Val et al. (1999) J. Ailerg. Clin. Immunol. 103: 690-697). Using the 
same strategy polypeptide spacers between the first and second redox proteins 
may be evaluated for their efficiency. 

First and second redox proteins that may be used herein include without 
limitation any first redox protein and second redox protein selected from the 
15 group of redox proteins consisting of cytochromes, such as cytochrome a, 
cytochrome b and cytochrome c; porphyrin containing proteins, for example 
hemoglobin; iron-sulfur proteins, such as ferredoxin; flavoproteins such as 
thioredoxin-reductase, ISIADH dehydrogenase, succinate dehydrogenase, 
dihydrolipoyl dehydrogenase, acyl-CoA dehydrogenase, D-amino acid oxidase, 
20 xanthine oxidase, orotate reductase and aldehyde oxidase; pyridine-linked 
dehydrogenases, for example, lactate dehydrogenase, glyceraldehyde-3- 
phosphate dehydrogenase, maiate dehydrogenase, and beta-hydroxy-butarate 
dehydrogenase; and redox active thiol containing proteins such as thioredoxin. 
In particular embodiments, the redox proteins provided herein are 
25 thioredoxin and its reductant thioredoxin-reductase (which are jointly also 

referred to herein as "thioredoxin-related" protein(s)). As used herein, the term 
"thioredoxin" refers to relatively small proteins (typically approximately 1 2 kDa) 
that belong to the family of thioltransferases which catalyze oxido-reductions via 
the formation or hydrolysis of disulfide bonds and are widely, if not universally, 
30 distributed throughout the animal plant and bacterial kingdom. The reduces form 
of thioredoxin is an excellent catalyst for the reduction of even the most 
intractable disulfide bonds, in order to reduce the oxidized thioredoxin, two 
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cellular reductants provide the reduction equivalents: reduced ferredoxin and 
NADPH. These reduction equivalents are supplied to thioredoxin via interaction 
or association with different thioredoxin-reductases including the NADPH 
thioredoxin-reductase and ferredoxin thioredoxin-reductase. The supply of these 
5 reduction equivalents requires the formation of a heteromultimeric-protein- 
. complex comprising thioredoxin and thioredoxin-reductase. Ferredoxin 
thioredoxin-reductase is involved in the reduction of plant thioredoxins 
designated as Trxf and Trxm, both of which are involved in the regulation of 
photosynthetic processes in the chloroplast. The NADPH/thioredoxin active in 
10 plant seeds is designated Trxh (also referred to herein as thioredoxin h-type) and 
is capable of the reduction of a wide range of proteins thereby functioning as an 
important cellular redox buffer. Generally, only one kind of thioredoxin, which 
analogous to the plant Trxh type, is found in bacterial or animal cells. The h- 
type thioredoxins are capable of being reduced by NADPH and NADPH- 

15 thioredoxin reductase. 

Exemplary thioredoxins are further characterized as a protein having a 
core of 5 beta-sheets surrounded by 4 to 6 alpha helixes. Exemplary 
thioredoxins are further characterized by having an active site containing the 
consensus amino acid sequence: 

20 XCYYCZ, 

wherein Y is any amino acid, such as hydrophobic or non-polar amino acids, 
wherein X can be any of the 20 amino acids, preferably a hydrophobic amino 
acid, such as a tryptophan, and 

Z can be any amino acid, preferably polar amino acids. 
25 In certain embodiments, the thioredoxins for use herein comprise an active site 
having the amino acid sequence XCG PCZ. 

When the cysteines in the active site of thioredoxin or thioredoxin-like proteins 
are oxidized, they form an intramolecular disulfide bond. In the reduced state, 
the same active sites are capable of participating in redox reactions through the 
30 reversible oxidation of its active site dithiol, to a disulfide and catalyzes 
dithioldisulfide exchange reactions. 
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Exemplary thioredoxins are well-known in the art and can be obtained 
from several organisms including Arabidopsis thaliana (Riveira Madrid et al. 
(1995) Proc. Natl. Acad. Sci. 92: 5620-5624), wheat (Gautier et al. (1998) Eur. 
J. Biochem. 252: 314-324); Escherichia coli (Hoeoeg et al (1984) Biosci. Rep. 4: 
5 917-923) and thermophylic microorganisms such as Methanococcus jannaschii 
and Archaeoglobus fulgidus (PCT Patent Application 00/36126). Thioredoxins 
have also been recombinantly expressed in several host systems including 
bacteria (Gautier et al. (1998) Eur J. Biochem. 252: 314-324) and plants (PCT 
Patent Application WO 00/58453) Commercial preparations of E. coli sourced 
10 Thioredoxins are readily available from for example: Sigma Cat No. T 0910 
Thioredoxin [E. coli, recombinant; expressed in E. coli). 

Exemplary nucleic acid sequences encoding thioredoxin polypeptides for 
use herein are readily available from a variety of diverse biological sources 
including E. coli (Hoeoeg et al. (1984) Biosci. Rep.: 4 917-923); Methanococcus 
1 5 jannaschii and Archaeoglobus fulgidus (PCT Patent Application 00/36 1 26); 
Arabidopsis thaliana (Rivera-Madrid (1995) Proc. Natl. Acad. Sci. 92: 5620- 
5624); wheat (Gautier et al (1998) Eur. J. Biochem. 252(2): 314-324); tobacco 
(Marty et al. (1991) Plant Mol. Biol. 17: 143-148); barley (PCT Patent 
Application 00/58352); rice (Ishiwatari et al. (1995) Planta 195: 456-463); 
20 soybean (Shi et al. (1 996) Plant Mol. Biol. 32: 653-662); rapeseed (Bower et al. 
Plant Cell 8: 1641-1650) and calf (Terashima et al. (1999) DNA Seq. 10(3): 
203-205); and the like. In yet other embodiments, exemplary nucleic acids for 
use herein include those encoding the thioredoxin and thioredoxin-like 
polypeptide chains set forth as SEQ ID NOs:38, 42, 46 and 50; and those 
25 encoding the thioredoxin and thioredoxin-like polypeptide chains set forth in 
Table 5 as SEQ ID NOs:52-194. The respective nucleic acid sequences 
encoding the amino acids set forth in SEQ ID NOs:52-1 94 can be readily 
identified via the Swiss Protein identifier (accession) numbers provided in Table 5 
(in parenthesis). 

30 As used herein, the term "thioredoxin-reductase" refers to a protein that 

complexes with a flavin, such as FAD. The flavin compound serves as an 
electron donor for the thioredoxin-reductase protein active site. Thioredoxin 
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reductases have a redox active, disulfide bond site capable of reducing 
thioredoxin. The active site of thioredoxin-reductase contains 2 cysteines. The 
type of amino acids surrounding the 2 cysteine residues forming the active site 
can vary as hydrophobic, non-polar or polar. An exemplary thioredoxin- 
5 reductase is NADPH-thioredoxin-reductase (NTR), which is a cytosolic 
homodimeric enzyme comprising typically 300-500 amino acids. Crystal 
structures of both E. coli and plant thioredoxin-reductase have been obtained 
(Waksman et al. (1994) J. Mol. Biol. 236: 800-816; Dai et al. (1996) J. MoJ. 
Biol. 264:1044-1057). NADPH-thioredoxin-reductases have been expressed in 
10 heterologous hosts, for example the Arabidopsis NADPH-thioredoxin-reductase 
has been expressed in E. coli (Jacquot et al. (1994) J. Mol. Biol. 235: 1357- 
1363) and wheat (PCT Patent Application 00/58453). 

Exemplary nucleic acid sequences encoding thioredoxin-reductase 
proteins can readily be obtained from a variety of sources, such as from the 
15 sequence set forth in Table 5 and the Sequence Listing provide herein, from 
Arabidopsis (Riveira Madrid et al. (1995) Proc. Natl. Acad. Sci. USA 92: 5620- 
5624), E. coli (Russel et al. (1988) J. Biol. Chem. 263: 9015-9019); barley (PCT 
Patent Application 00/58352 and wheat (Gautier et al., (1998) Eur. J. Biochem. 
252: 314-324); and the like. In yet other embodiments, exemplary nucleic acids 
20 for use herein include those encoding the thioredoxin-reductase polypeptide 
chains set forth as SEQ ID NOs:8, 9, 10, 40, 44, 48 and 50; and those 
encoding the thioredoxin-reductase polypeptide chains set forth in Table 5 as 
SEQ ID NOs: 195-3 13. The respective nucleic acid sequences encoding the 
amino acids set forth in SEQ ID NOs: 1 95-31 3 can be readily identified via the 
25 Swiss Protein identifier (accession) numbers provided in Table 5 (in parenthesis). 

Also contemplated for use in the methods and compositions provided 
herein are nucleic acid and amino acid homologs that are "substantially 
homologous" to the thioredoxin and thioredoxin-reductase nucleic and amino 
acids set forth herein, which includes thioredoxin and thioredoxin-reductase 
30 polypeptides encoded by a sequence of nucleotides that hybridizes under 

conditions of low, moderate or high stringency to the sequence of nucleotides 
encoding the thioredoxin and thioredoxin-reductase nucleic and amino acids set 
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forth herein {e.g., in the Examples, Sequence Listing and/or Table 5). As used 
herein, a DNA or nucleic acid homolog refers to a nucleic acid that includes a 
preselected conserved nucleotide sequence, such as a sequence encoding a 
therapeutic polypeptide. By the term "substantially homologous" is meant 
5 having at least 80%, preferably at least 90%, most preferably at least 95% 

homology therewith or a less percentage of homology or identity and conserved 

. biological activity or function. 

The terms "homology" and "identity" are often used interchangeably. In 
this regard, percent homology or identity may be determined, for example, by 
10 comparing sequence information using a GAP computer program. The GAP 

program utilizes the alignment method of Needleman and Wunsch (J. MoL Biol. 
48:443 (1970), as revised by Smith and Waterman (Adv. Appl. Math. 2:482 
(1981). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 
15 number of symbols in the shorter of the two sequences. The preferred default 
parameters for the GAP program may include: (1) a unary comparison matrix 
(containing a value of 1 for identities and 0 for non-identities) and the weighted 
comparison matrix of Gribskov and Burgess, NucL Acids Res. 14:6745 (1986), 
as described by Schwartz and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE 
20 AND STRUCTURE, National Biomedical Research Foundation, pp. 353-358 

(1979); (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 
symbol in each gap; and (3) no penalty for end gaps. 

By sequence identity, the number of conserved amino acids are 
determined by standard alignment algorithms programs, and are used with 
25 default gap penalties established by each supplier. Substantially homologous 
nucleic acid molecules would hybridize typically at moderate stringency or at 
high stringency all along the length of the nucleic acid of interest. Preferably the 
two molecules will hybridize under conditions of high stringency. Also 
contemplated are nucleic acid molecules that contain degenerate codons in place 
30 of codons in the hybridizing nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
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determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson and Lipman, Proc. Natl. 
Acad. Sci. USA 85:2444 (1988). Alternatively the BLAST function of the 
National Center for Biotechnology Information database may be used to 
5 determine relative sequence identity. 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated 
using published techniques. (See, e.g.: Computational Molecular Biology. Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 
10 and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Parti, Griffin, A.M., and Griffin, H.G., 
eds., Humana Press, New Jersey, 1 994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). 
15 While there exist a number of methods to measure identity between two 

polynucleotide or polypeptide sequences, the term "identity" is well known to 
skilled artisans (Carillo, H. & Upton, D., SIAM J Applied Math 45:1073 (1988)). 
Methods commonly employed to determine identity or similarity between two 
sequences include, but are not limited to, those disclosed in Guide to Huge 
20 Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 

Carillo, H. & Upton, D., SIAM J Applied Math 48:1073 (1988). Methods to 
determine identity and similarity are codified in computer programs. Preferred 
computer program methods to determine identity and similarity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., 
25 et al.. Nucleic Acids Research 12(0:387 (1 984)), BLASTP, BLASTN, FASTA 
(Atschul, S.F., eta/., J Molec Biol 216:403 (1990)). 

Therefore, as used herein, the term "identity" represents a comparison 
between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that- is 90% or more 
30 identical to a reference polypeptide. 

As used herein, the term at least "90% identical to" refers to percent 
identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
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level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 100 amino acids are 
compared. No more than 10% (i.e., 10 out of 100) amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 
5 may be made between a test and reference polynucleotides. Such differences- 
may be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they may be clustered in one or more 
locations of varying length up to the maximum allowable, e.g. 10/100 amino 
acid difference (approximately 90% identity). Differences are defined as nucleic 
10 acid or amino acid substitutions, or deletions. 

As used herein: stringency of hybridization in determining percentage 

mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 
15 3) low stringency: 1.0 x SSPE, 0.1% SDS, 50°C 

Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e^, Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Clo ninn A Laboratory Manual, Cold Spring 
Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 

20 that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.1 8 NaCI. Further, those of skill in the art recognize that the stability 
of hybrids is determined by T m , which is a function of the sodium ion 
concentration and temperature (T m = 81.5° C-1 6.6<log 10 [Na*]) + 0.41 (%Q + C)- 
600/D), so that the only parameters in the wash conditions critical to hybrid 

25 stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies may be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows (see also 
Shilo and Weinberg, Proa. Natl. Acad. Sci. USA. 78:6789-6792 (1981)): Filters 

30 containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.1 % PVP, 0.1 % 
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Ficoll, 1% BSA, and 500 //g/ml denatured salmon sperm DNA (10X SSC is 1 .5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 

In a particular embodiment, a heteromultimeric-protein-compiex is 
produced as a fusion polypeptide between the first and second redox protein, 
B wherein the first redox protein is thioredoxin and the second redox protein is a 
thioredoxin-reductase. In one embodiment, the second recombinant polypeptide, 
e.g., the thioredoxin-reductase is positioned N-terminal relative to the first 
recombinant polypeptide, e.g., the thioredoxin. Accordingly, any protein which 
is classified as thioredoxin, such as the thioredoxin component of the NADPH 
10 thioredoxin system and the thioredoxin present in the ferredoxin/thioredoxin 
system also known as TRx and TRm may be used in combination with any 
thioredoxin-reductase such as the NADPH thioredoxin-reductase and the 
ferredoxin-thioredoxin-reductase and any other proteins having the capability of 
reducing thioredoxin. In particular embodiments the thioredoxin and thioredoxin- 

15 reductase are plant derived. 

In an alternate embodiment, the naturally occurring nucleic acid sequence 
encoding the thioredoxin/thioredoxin-reductase protein fusion obtainable from 
Mycobacterium leprae (Wieles et al. (1995) J. Biol. Chem. 27:25604-25606) is 
used, as set forth in the Examples herein. 
20 Immunoglobulins 

In another embodiment of the present invention, the multimeric- protein- 
complexes are immunoglobulins. As used herein "immunoglobulin-polypeptide- 
chain" refers to a first polypeptide that is capable of associating with a second^ 
polypeptide to form an immunologically active (i.e. capable of antigen binding) 
25 multimeric-protein-complex. The types of immunoglobulins and immunoglobulin- 
polypeptide-chains contemplated for use herein include the immunologically 
active (i.e. antigen binding) portions of a light and heavy chain. Exemplary 
immunoglobulins and immunoglobulin-polypeptide-chains for use herein include 
substantially intact immunoglobulins, including any IgG, IgA, IgD, IgE and IgM, 
30 as well as any portion of an immunoglobulin, including those portions well- 
known as Fab fragments, Fab' fragments, F(ab').sub2. fragments and Fv 
fragments. 
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In this embodiment, the first recombinant polypeptide may be any 
immunoglobulin heavy chain, including any IgG, IgA, IgD, IgE or IgM heavy 
chain, and the second recombinant polypeptide may be a kappa or lambda 
immunoglobulin light chain. Accordingly, provided herein are methods of 
5 producing an immunoglobulin, said method comprising: (a) producing in a cell 
comprising oil bodies, a first immunoglobulin-polypeptide-chain and a second 
immunoglobulin-polypeptide-chain wherein said first immunoglobulin-polypeptide- 
chain is capable of associating with said second immunoglobulin-polypeptide- 
chain to form said immunoglobulin; and (b) associating said immunoglobulin with 
10 an oil body through an oil-body-targeting-protein capable of associating with said 
oil bodies and said first immunoglobulin-polypeptide-chain. 

As set forth herein, the multimeric immunoglobulin is associated with an 
oil body through an oil-body-targeting-protein. In particular embodiments, the 
oil-body-targeting-protein may be a fusion polypeptide comprising an 
15 oil-body-protein and an immunoglobulin binding protein, such as for example 
protein A, protein L, and protein G. 

In yet another embodiment involving immunoglobulins, the first and 
second recombinant polypeptides (immunoglobulins) are separately fused to an 
oil body protein, for example an oleosin or caleosin. For example, 
20 a) the first recombinant polypeptide may be an immunoglobulin heavy 

chain, including any IgG, IgA, IgD, IgE or IgM heavy chain, and the second 
recombinant polypeptide may be a kappa or lambda immunoglobulin light chain; 



or 



b) the first recombinant polypeptide may be the variable and first 
25 constant domain from an immunoglobulin heavy chain and the second 

recombinant polypeptide may be a kappa or lambda immunoglobulin light chain; 



or 



c) the first recombinant polypeptide may be the variable domain from 
an immunoglobulin heavy chain and the second recombinant polypeptide may be 
30 the variable domain from a kappa or lambda immunoglobulin light chain. 

In certain embodiments, the fusion polypeptides are designed or selected 
to allow the heteromultimeric-protein-complex formation between 
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immunoglobulin light and heavy chain sequences on the oil bodies within the cell 
comprising oil bodies. 

Preparation of expression vectors comprising oil-body-targeting-proteins and the 
first and/or second recombinant polypeptides, multimeric-protein-complexes, 
5 heteromultfmerlc-proteln-complexes, multimeric-fusion-protelns, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins. 

In accordance with the present invention, the first and/or second 

10 recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins; and the oil-body-targeting- 
protein are conveniently produced in a cell. In order to produce the recombinant 

15 polypeptides or multimeric-protein-complexes, a nucleic acid sequence encoding 
either the first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 

20 proteins; and/or the oil-body-targeting-protein are incorporated in a recombinant 
expression vector. Accordingly, provided herein are recombinant expression 
vectors comprising the chimeric nucleic acids provided herein suitable for 
expression of the oil-body-targeting-protein and the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

26 protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins, suitable for the selected 
cell. The term "suitable for expression in the selected cell" means that the 
recombinant expression vector contains all nucleic acid sequences required to 

30 ensure expression in the selected cell. 

Accordingly, the recombinant expression vectors further contain 
regulatory nucleic acid sequences selected on the basis of the cell which is used 
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for expression and ensuring initiation and termination of transcription operatively 
linked to the nucleic acid sequence encoding the recombinant polypeptide or 
multimeric-protein-complex and/or the oil-body-targeting-protein. Regulatory 
nucleic acid sequences include promoters, enhancers, silencing elements, 
5 ribosome binding sites, Shine-Dalgarno sequences, introns and other expression 
elements. "Operatively linked" is intended to mean that the nucleic acid 
sequences comprising the regulatory regions linked to the nucleic acid sequences 
encoding the recombinant polypeptide or multimeric-protein-complex and/or the 
oil-body-targeting-protein allow expression in the cell. A typical nucleic acid 
10 construct comprises in the 5' to 3' direction a promoter region capable of 
directing expression, a coding region comprising the first and/or second 
recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 
protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
15 or the first and/or second thioredoxin-related proteins; and/or an oil-body- 
targeting-protein and a termination region functional in the selected cell. 

The selection of regulatory sequences will depend on the organism and 
the cell type in which the f irst and/or second recombinant polypeptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 
20 fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 

immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, or the first and/or 
second thioredoxin-related proteins; and/or the oil-body-targeting-protein is 
expressed, and may influence the expression levels of the polypeptide. 
Regulatory sequences are art-recognized and selected to direct expression of the 
25 oil-body-targeting-protein and the recombinant polypeptides or multimeric- 
protein-complexes in the cell. 

Promoters that may be used in bacterial cells include the lac promoter 
(Blackman et al. H 978) Cell: 13: 65-71), the trp promoter (Masuda et al. (1996) 
Protein Eng: 9: 101-106) and the T7 promoters (Studier et al. (1986) J. Mol. 
30 Biol. 1 89: 11 3-1 30). Promoters functional in plant cells that may be used herein 
include constitutive promoters such as the 35S CaMV promoter (Rothstein et al. 
(1987) Gene: 53: 153-161) the actin promoter (McElroy et al. (1990) Plant Cell 
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2: 163-171) and the ubiquitin promoter (European Patent Application 0 342 
926). Other promoters are specific to certain tissues or organs (for example, 
roots, leaves, flowers or seeds) or cell types (for example, leaf epidermal cells, 
mesophyll cells or root cortex cells) and or to certain stages of plant 
5 development. Timing of expression may be controlled by selecting an inducible 
promoter, for example the PR-a promoter described in US Patent 5,614,395. 
Selection of the promoter therefore depends on the desired location and timing 
of the accumulation of the desired polypeptide. In a particular embodiment, the 
first and/or second recombinant polypeptides, multimeric-protein-complexes, 
10 heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins; and the oil-body-targeting-protein are expressed in a seed cell and seed 
. specific promoters are utilized. Seed specific promoters that may be used herein 
15 include for example the phaseolin promoter (Sengupta-Gopalan et al. (1985) 
Proc. Natl. Acad. Sci. USA: 82: 3320-3324), and the Arabidopsis 18 kDa 
oleosin promoter (van Rooijen et al. (1992) Plant. Mol. Biol. 18: 1177-1179). 
New promoters useful in various plant cell types are constantly discovered. 
Numerous examples of plant promoters may be found in Ohamuro et al. 

20 (Biochem of PI. (1989) 15: 1-82). 

Genetic elements capable of enhancing expression of the polypeptide may 
be included in the expression vectors. In plant cells these include for example, 
the untranslated leader sequences from viruses such as the AMV leader 
sequence (Jobling and Gehrke (1987) Nature: 325: 622-625) and the intron 

25 associated with the maize ubiquitin promoter (See: US Patent 5,504,200). 

Transcriptional terminators are generally art recognized and besides 
serving as a signal for transcription termination serve as a protective element 
serving to extend the mRNA half-life (Guarneros et al. (1982) Proc. Natl. Acad. 
Sci. USA: 79: 238-242). In nucleic acid sequences for the expression in plant 

30 cells, the transcriptional terminator typically is from about 200 nucleotide to 
about 1000 nucleotides in length. Terminator sequences that may be used 
herein include for example, the nopaline synthase termination region (Bevan et 
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al. (1983) Nucl. Acid. Res.: 11: 369-385), the phaseolin terminator (van der 
Geest et al. (1 994) Plant J.: 6: 413-423), the terminator for the octopine 
synthase gene of Agrobacterium tumefaciens or other similarly functioning 
elements. Transcriptional terminators can be obtained as described by An 
5 (1 987) Methods in Enzym. 1 53: 292). The selection of the transcriptional 
terminator may have an effect on the rate of transcription. 

Accordingly, provided herein are chimeric nucleic acid sequences 
encoding a first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
10 heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and/or thioredoxin-related proteins. In one 
embodiment, said nucleic acid comprises: 

(a) a first nucleic acid sequence encoding an oil-body-targeting-protein 

operatively linked in reading frame to; 
15 (b) a second nucleic acid sequence encoding a first recombinant 

polypeptide, immunoglobulin-polypeptide-chain, or redox protein; linked in 

reading frame to; 

(c) a third nucleic acid sequence encoding a second recombinant 
polypeptide, immunoglobulin-polypeptide-chain or redox protein, wherein said 
20 first and second recombinant polypeptides, immunoglobulin-polypeptide-chains 
or redox proteins are capable of forming a multimeric-protein-complex. 

In another embodiment, provided herein is an expression vector 
comprising: 

1) a first nucleic acid sequence capable of regulating transcription in said 

25 cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a nucleic acid sequence encoding a sufficient portion 
of an oil-body-protein to provide targeting of said recombinant fusion polypeptide 
to an oil body linked in reading frame to (ii) a nucleic acid sequence encoding a 

30 multimeric-fusion-protein, such as a redox fusion polypeptide or immunoglobulin, 
comprising a first recombinant polypeptide, such as a redox protein or 
immunoglobulin-polypeptide-chain, linked to a second recombinant polypeptide. 
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such as a second redox protein or a second immunoglobulin-polypeptide-chain, 

operatively linked to; 

3) a third nucleic acid sequence capable of terminating transcription in 

said celt. 

5 The recombinant expression vector further may contain a marker gene. 

Marker genes that may be used in accordance with the present invention include 
all genes that allow the distinction of transformed cells from non-transformed 
cells including all selectable and screenable marker genes. A marker may be a 
resistance marker such as an antibiotic resistance marker against for example 
10 kanamycin, ampicillin, G418, bleomycin hygromycin, chloramphenicol which 
allows selection of a trait by chemical means or a tolerance marker against for 
example a chemical agent such as the normally phytotoxic sugar mannose 
{Negrotto et al. (2000) Plant Cell Rep. 19: 798-803). In plant recombinant 
expression vectors herbicide resistance markers may conveniently be used for 
15 example markers conferring resistance against glyphosate (US Patents 

4,940,935 and 5,188,642) or phosphinothricin (White et al. (1990) Nucl. Acids 
Res. 18: 1062; Spencer et al. (1990) Theor. Appl. Genet. 79: 625-631). 
Resistance markers to a herbicide when linked in close proximity to the redox 
protein or oil-body-targeting-protein may be used to maintain selection pressure 
20 on a population of plant cells or plants for those plants that have not lost the 
protein of interest. Screenable markers that may be employed to identify 
transformants through visual observation include beta-glucuronidase (GUS) (see 
US Patents US5,268,463 and US5,599,670) and green fluorescent protein (GFP) 
(Niedz et al. (1995) Plant Cell Rep.: 14: 403). 
25 The recombinant expression vectors further may contain nucleic acid 

sequences encoding targeting signals ensuring targeting to a cell compartment or 
organelle. Suitable targeting signals that may be used herein include those that 
are capable of targeting polypeptides to the endomembrane system. Exemplary 
targeting signals that may be used herein include targeting signals capable of 
30 directing the protein to the periplasm, the cytoplasm, the golgi apparatus, the 
apoplast (Sijmons et al., 1990, Bio/Technology, 8:217-221) the chloroplast 
(Comai et al. (1988) J. Biol. Chem. 263: 1 5 1 04-1 5 1 09), the mitochondrion, the 
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peroxisome (Unger et al. (1989) Plant Mol. Biol. 13: 411-418), the ER, the 
vacuole (Shinshi et al. (1990) Plant Mol. Biol. 14: 357-368 and the oil body. By 
the inclusion of the appropriate targeting sequences it is possible to direct the 
oil-body-targeting-protein or the first and/or second recombinant polypeptides, 
5 multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 
fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 
immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and/or 
thioredoxin-related proteins, to the desired organelle or cell compartment. 

The recombinant expression vectors of the present invention may be 
10 prepared in accordance with methodologies well known to those of skill in the 
art of molecular biology (see for example: Sambrook et al. (1990) Molecular 
Cloning, 2 nd ed. Cold Spring Harbor Press). The preparation of these constructs 
may involve techniques such as restriction digestion, ligation, gel 
electrophoresis, DNA sequencing and PCR. A wide variety of cloning vectors is 
15 available to perform the necessary cloning steps resulting in a recombinant 

expression vector ensuring expression of the polypeptide. Especially suitable for 
this purpose are vectors with a replication system that is functional in 
Escherichia coli such as P BR322, the PUC series of vectors, the M13mp series 
of vectors, pBluescript etc. Typically these vectors contain a marker allowing 
20 the selection of transformed cells for example by conferring antibiotic resistance. 
Nucleic acid sequences may be introduced in these vectors and the vectors may 
be introduced in E. coli grown in an appropriate medium. Vectors may be 
recovered from cells upon harvesting and lysing the cells. 

Recombinant expression vectors suitable for the introduction of nucleic 
25 acid sequences in plant cells include Agrobacterium and Rhizobium based 

vectors such as the Ti and Ri plasmids. Agrobacterium based vectors typically 
carry at least one T-DNA border sequence and include vectors such pBIN 1 9 
(Bevan (1984) Nucl Acids Res. Vol. 12, 22:8711-8721) and other binary vector 
systems (for example: US Patent 4,940,838). 
30 Production of cells comprising a first and/or second recombinant polypeptides, 
multimeric-protein-complexes. heteromultimeric-protein-complexes, multimeric- 
fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins. 
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immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, and/or a first 
and/or second thioredoxin-related protein and oil-body-targeting-proteins 

In accordance with the present invention, the recombinant expression 
vectors are introduced into the cell that is selected and the selected cells are 
5 grown to produce the first and/or second recombinant polypeptides, multimeric- 
protein-complexes, heteromultimeric-protein-compiexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 
polypeptide-chains, redox-fusion-polypeptides, a first and/or second thioredoxin- 
related protein; and the oil-body-targeting-protein either directly or in a progeny 
10 cell. 

Methodologies to introduce recombinant expression vectors into a cell 
also referred to herein as "transformation" are well known to the art and vary 
depending on the cell type that is selected- General techniques to transfer the 
recombinant expression vectors into the cell include electroporation; chemically 

15 mediated techniques, for example CaCl2 mediated nucleic acid uptake; particle 
bombardment (biolistics); the use of naturally infective nucleic acid sequences 
for example virally derived nucleic acid sequences or when plant ceils are used 
Agrobacterium or Rhizobium derived nucleic acid sequences; PEG mediated 
nucleic acid uptake, microinjection, and the use of silicone carbide 'whiskers 

20 (Kaeppler et al. (1 990) Plant Cell Rep. 9:41 5-41 8) all of which may be used 
herein. 

Introduction of the recombinant expression vector into the cell may result 
in integration of its whole or partial uptake into host cell genome including the 
chromosomal DNA or the plastid genome. Alternatively the recombinant 

25 expression vector may not be integrated into the genome and replicate 

independently of the host cell's genomic DNA. Genomic integration of the 
nucleic acid sequence is typically used as it will allow for stable inheritance of 
the introduced nucleic acid sequences by subsequent generations of cells and 
the creation of cell, plant or animal lines. 

30 Particular embodiments involve the use of plant cells. Particular plant 

cells used herein include cells obtainable from Brazil nut [Betholletia exceJsa); 
castor (Riccinus communis)', coconut (Cocus nucifera); coriander (Coriandrum 
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sativum); cotton (Gossypium spp.); groundnut (Arachis hypogaea); jojoba 
(Simmondsia chinensis); linseed/flax (Unum usitatissimum); maize (Zea /nays); 
mustard (Brassica spp. and S/hap/s alba); oil palm (f/ae/s ou/nee/s); olive (Olea 
europaea); rapeseed Ma spp.); safflower {Carthamus tinctorius); soybean 
5 (Glycine max); squash [Cuct/rfwfa maxima); barley (Wo«/et//n w/fcare); wheat 
(Traeticum aestivum) and sunflower (Helianthus annuus). 

Transformation methodologies for dicotelydenous plant species are well 
known. Generally Agrobacterium mediated transformation is utilized because of 
its high efficiency as well as the general susceptibility by many, if not all 
10 dicotelydenous plant species. Agrobacterium transformation generally involves 
,the transfer of a binary vector (e.g. pBIN19) comprising the DNA of interest to 
an appropriate Agrobacterium strain (e.g. C1B542) by for example tri-parental 
mating with an E. coli strain carrying the recombinant binary vector and an E. 
co/i strain carrying a helper plasmid capable of mobilization of the binary vector 
1 5 to the target Agrobacterium strain, or by DNA transformation of the 

Agrobacterium strain (Hofgen et al. Nucl. Acids. Res. (1988) 16: 9877. Other 
transformation methodologies that may be used to transform dicotelydenous 
plant species include biolistics (Sanford (1988) Trends in Biotechn. 6: 299-302); 
electroporation (Fromm et al. (1985) Proc. Natl. Acad. Sci. USA 82: 5824- 
20 5828); PEG mediated DNA uptake (Potrykus et al. (1985) Mol. Gen. Genetics 
199: 169-177); microinjection (Reich et al. Bio/Techn. (1986) 4: 1001-1004) 
and silicone carbide whiskers (Kaeppler et al. (1990) Plant Cell Rep. 9: 41 5- 
418). The exact transformation methodologies typically vary somewhat 
depending on the plant species that is used. 
25 In a particular embodiment the oil bodies are obtained from safflower and 

the recombinant proteins are expressed in safflower. Safflower transformation 
has been described by Baker and Dyer (Plant Cell Rep. (1996) 16: 106-110). 

Monocotelydenous plant species may now also be transformed using a 
variety of methodologies including particle bombardment (Christou et al. (1991) 
30 Biotechn. 9: 957-962; Weeks et al. Plant Physiol. (1993) 102: 1077-1084; 

Gordon-Kamm et al. Plant Cell (1990) 2: 603-61 8) PEG mediated DNA uptake 
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(EP 0 292 435; 0 392 225) or Agrobacterium-mediated transformation (Goto- 
Fumiyuki et al (1999) Nature-Biotech. 17 (3):282-286). 

Plastid transformation is described in US Patents 5,451,513; 5,545,817 
and 5,545,818; and PCT Patent Applications 95/16783; 98/1 1235 and 
B 00/39313) Basic chloroplast transformation involves the introduction of cloned 
plastid DNA flanking a selectable marker together with the nucleic acid sequence 
of interest into a suitable target tissue using for example biolistics or protoplast 
transformation. Selectable markers that may be used include for example the 
bacterial aadA gene (Svab et al. (1993) Proc. Natl. Acad. Sci. USA 90: 913- 
10 917). Plastid promoters that may be used include for example the tobacco dpP 
gene promoter (PCT Patent Application 97/06250). 

In another embodiment, the invention chimeric nucleic ecid constructs 
provided herein are directly transformed into the plastid genome. Plastid 
transformation technology is described extensively in U.S. Patent Nos. 
15 5,451,513, 5,545,817, 5,545,818 and 5,576,198; in PCT application nos. W0 
95/16783 and WO 97/32977; and in McBride et. al., Proc Natl Acad Sci USA 
91: 7301-7305 (1994), the entire disclosures of all of which are hereby 
incorporated by reference. In one embodiment, plastid transformation is 
achieved via biolistics, first carried out in the unicellular green alga 
20 Ch/amydomonas reinhardtii (Boynton et al. (1 988) Science 240:1 534-1 537)) and 
then extended to Nicotiana tabacum (Svab et al. (1 990) Proc Natl Acad Sci USA 
87:8526-8530), combined with selection for cis-acting antibiotic resistance loci 
(spectinomycin or streptomycin resistance) or complementation of non- 
photosynthetic mutant phenotypes. 
25 In another embodiment, tobacco plastid transformation is carried out by 

particle bombardment of leaf or callus tissue, or polyethylene glycol (PEG)- 
mediated uptake of plasmid DNA by protoplasts, using cloned plastid DNA 
flanking a selectable antibiotic resistance marker. For example, 1 to 1 .5 kb 
flanking regions, termed targeting sequences, facilitate homologous 
30 recombination with the plastid genome and allow the replacement or 

modification of specific regions of the 156 kb tobacco plastid genome. In one 
embodiment, point mutations in the plastid 1 6S rDNA and rps12 genes 
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conferring resistance to spectinomycin and/or streptomycin can be utilized as 
selectable markers for transformation (Svab et a/. (1 990) Proc Natl Acad Sci 
USA 87:8526-8530; Staub era/. (1992) Plant Cell 4:39-45, the entire 
disclosures of which are hereby incorporated by reference), resulting in stable 
5 homoplasmic transformants at a frequency of approximately one per 100 

bombardments of target leaves. The presence of cloning sites between these 
markers allows creation of a plastid targeting vector for introduction of fore.gn 
genes (Staub era/- (1993) EMBO J 1 2:601 -606, the entire disclosure of which is 
hereby incorporated by reference). In another embodiment, substantial increases 
1 0 in transformation frequency can be obtained by replacement of the recessive 
rRNA or r-protein antibiotic resistance genes with a dominant selectable marker, 
the bacterial aadA gene encoding the spectinomycin-detoxifying enzyme 
aminoglycoside-3'-adeny.transferase (Svab et al. (1 993) Proc Natl Acad Sci USA 
90: 913-917, the entire disclosure of which is hereby incorporated by reference). 
This marker has also been used successfully for high-frequency transformation 
of the plastid genome of the green alga Chlamydomonas reinhardtii 
(Goldschmidt-C.ermont, M. d 991 1 Nucl Acids Res 19, 4083-4089, the entire 
disclosure of which is hereby incorporated by reference). In other embodiments, 
plastid transformation of protoplasts from tobacco and the moss Physcomitrella 
can be attained using PEG-mediated DNA uptake (O'Neill et al. (1 993) Plant J 
3:729-738; Koop et at. (1996) Pianta 1 99:1 93-201 , the entire disclosures of 
which are hereby incorporated by reference). 

Both particle bombardment and protoplast transformation are also 
contemplated for use herein. Plastid transformation of oilseed plants has been 
successfully carried out in the genera Arabidopsis and Brassica (Sikdar et al. 
(1998) Plant Cell Rep 18:20-24; PCT Application WO 00/39313, the entire 
disclosures of which are hereby incorporated by reference). 

A chimeric nucleic sequence construct is inserted into a plastid 
expression cassette including a promoter capable of expressing the construct in 
plant plastids. A particular promoter capable of expression in a plant plastid is, 
for example, a promoter isolated from the 5' flanking region upstream of the 
coding region of a plastid gene, which may come from the same or a different 
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species, and the native product of which is typically found in a majority of 
plastid types including those present in non-green tissues. Gene expression in 
plastids differs from nuclear gene expression and is related to gene expression in 
prokaryotes (Stern et at. (1997) Trends in Plant Sci 2:308-31 5, the entire 
5 disclosure of which is hereby incorporated by reference). 

Plastid promoters generally contain the -35 and -10 elements typical of 
prokaryotic promoters, and some plastid promoters called PEP (plastid-encoded 
RNA polymerase) promoters are recognized by an E eoff-like RNA polymerase 
mostly encoded in the plastid genome, while other plastid promoters called NEP 
10 promoters are recognized by a nuclear-encoded RNA polymerase. Both types of 
plastid promoters are suitable for use herein. Examples of plastid promoters 
include promoters of cl P P genes such as the tobacco cl P P gene promoter (WO 
97/06250, the entire disclosure of which is hereby incorporated by reference) 
and the Arabidopsis dpP gene promoter (U.S. Application No. 09/038,878, the 
15 entire disclosure of which is hereby incorporated by reference). Another 

promoter capable of driving expression of a chimeric nucleic acid construct in 
plant plastids comes from the regulatory region of the plastid 1 BS ribosomal RNA 
operon (Harris etal., (1994) Microbiol Rev 58:700-754; Shinozaki et at. (1986) 
EMBO J 5:2043-2049, the entire disclosures of both of which are hereby 
20 incorporated by reference). Other examples of promoters capable of driving 

expression of a nucleic acid construct in plant plastids include a psbA promoter 
or am rbcL promoter. A plastid expression cassette preferably further includes a 
plastid gene 3' untranslated sequence (3' UTR) operatively linked to a chimeric 
nucleic acid construct of the present invention. The role of untranslated 
sequences is preferably to direct the 3' processing of the transcribed RNA rather 
than termination of transcription. An exemplary 3' UTR is a plastid r P s16 gene 
3' untranslated sequence, or the Arabidopsis plastid psbA gene 3' untranslated 
sequence. In a further embodiment, a plastid expression cassette includes a 
poly-G tract instead of a 3' untranslated sequence. A plastid expression 
cassette also preferably further includes a 5' untranslated sequence (5' UTR) 
functional in plant plastids, operatively linked to a chimeric nucleic acid construct 
provided herein. 
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A plastid expression cassette is contained in a plastid transformation 
vector, which preferably further includes flanking regions for integration into the 
plastid genome by homologous recombination. The plastid transformation vector 
may optionally include at least one plastid origin of replication. The present 
5 invention also encompasses a plant plastid transformed with such a plastid 

transformation vector, wherein the chimeric nucleic acid construct is expressible 
in the plant plastid. Also encompassed herein is a plant or plant cell, including 
the progeny thereof, including this plant plastid. In a particular embodiment, the 
plant or plant cell, including the progeny thereof, is homoplasmic for transgenic 
10 plastids. 

Other promoters capable of driving expression of a chimeric nucleic acid 
construct in plant plastids include transactivator-regulated promoters, preferably 
heterologous with respect to the plant or to the subcellular organelle or 
component of the plant cell in which expression is effected. In these cases, the 
1 5 DNA molecule encoding the transactivator is inserted into an appropriate nuclear 
expression cassette which is transformed into the plant nuclear DNA. The 
transactivator is targeted to plastids using a plastid transit peptide. The 
transactivator and the transactivator-driven DNA molecule are brought together 
either by crossing a selected plastid-transf ormed line with and a transgenic line 
20 containing a DNA molecule encoding the transactivator supplemented with a 
plastid T targeting sequence and operably linked to a nuclear promoter, or by 
directly transforming a plastid transformation vector containing the desired DNA 
molecule into a transgenic line containing a chimeric nucleic acid construct 
encoding the transactivator supplemented with a plastid-targeting sequence 
25 operably linked to a nuclear promoter. If the nuclear promoter is an inducible 
promoter, in particular a chemically inducible embodiment, expression of the 
chimeric nucleic acid construct in the plastids of plants is activated by foliar 
application of a chemical inducer. Such an inducible transactivator-mediated 
plastid expression system is preferably tightly regulable, with no detectable 
30 expression prior to induction and exceptionally high expression and accumulation 
of protein following induction. 
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A particular transactivator is, for example, viral RNA polymerase. 
Particular promoters of this type are promoters recognized by a single sub-unit 
RNA polymerase, such as the T7 gene 10 promoter, which is recognized by the 
bacteriophage T7 DNA-dependent RNA polymerase. The gene encoding the T7 
5 polymerase is preferably transformed into the nuclear genome and the T7 

polymerase is targeted to the plastids using a plastid transit peptide. Promoters 
suitable for nuclear expression of a gene, for example a gene encoding a viral 
RNA polymerase such as the T7 polymerase, are described above and elsewhere 
in this application. Expression of chimeric nucleic acid constructs in plastids can 
10 be constitutive or can be inducible, and such plastid expression can be also 
organ- or tissue-specific. Examples of various expression systems are 
extensively described in WO 98/1 1235, the entire disclosure of which is hereby 
, incorporated by reference. Thus, in one aspect, the present invention utilizes 
coupled expression in the nuclear genome of a chloroplast-targeted phage T7 
1 5 RNA polymerase under the control of the chemically inducible PR-1 a promoter, 
for example of the PR-1 promoter of tobacco, operably linked with, a chloroplast 
reporter transgene regulated by T7 gene 10 promoter/terminator sequences, for 
example as described in as in US Patent No. 5,614,395 the entire disclosure of 
which is hereby incorporated by reference. In another embodiment, when 
20 plastid transformants homoplasmic for the maternally inherited TR or NTR genes 
are pollinated by lines expressing the T7 polymerase in the nucleus, F1 plants 
are obtained that carry both transgene constructs but do not express them until 
synthesis of large amounts of enzymatically active protein in the plastids is 
triggered by foliar application of the PR-1 a inducer compound 
25 benzo(1,2,3)thiadiazole-7-carbothioic acid S-methyl ester (BTH). 

In a particular embodiment, two or more genes, for example TR and NTR 
genes, are transcribed from the plastid genome from a single promoter in an 
operon-like polycistronic gene. In one embodiment, the operon-like polycistronic 
gene includes an intervening DNA sequence between two genes in the operon- 
30 like polycistronic gene. In a particular embodiment, the intervening DNA 
sequence is not present in the plastid genome to avoid homologous 
recombination with plastid sequences. In another embodiment, the DNA 
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sequence is derived from the 5 ' untranslated (UTR) region of a non-eukaryotic 
gene, preferably from a viral 5 'UTR, preferably from a 5 'UTR derived from a 
bacterial phage, such as a T7, T3 or SP6 phage. In one embodiment, a portion 
of the DNA sequence may be modified to prevent the formation of RNA 
5 secondary structures in an RNA transcript of the operon-like polycistronic gene, 
for example between the DNA sequence and the RBS of the downstream gene. 
Such secondary structures may inhibit or repress the expression of the 
downstream gene, particularly the initiation of translation. Such RNA secondary 
structures are predicted by determining their melting temperatures using 
10 computer models and programs such a the "mfold" program version 3 (available 
from Zuker and Turner, Washington University School of Medicine, St-Louis, 
MO) and other methods known to one skilled in the art. 

The presence of the intervening DNA sequence in the operon-like 
polycistronic gene increases the accessibility of the RBS of the downstream 
15 gene, thus resulting in higher rates of expression. Such strategy is applicable to 
any two or more genes to be transcribed from the plastid genome from a single 
promoter in an operon-like chimeric heteromultimeric gene. 

Following transformation the cells are grown, typically in a selective 
medium allowing the identification of transformants. Cells may be harvested in 
20 accordance with methodologies known to the art. In order to associate the oil 

bodies with the first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, and a first and/or second thioredoxin-related 
25 protein, the integrity of cells may be disrupted using any physical, chemical or 
biological methodology capable of disrupting the cells' integrity. These 
methodologies are generally cell-type dependent and known to the skilled 
artisan. Where plants are employed they may be regenerated into mature plants 
using plant tissue culture techniques generally known to the skilled artisan. 
30 Seeds may be harvested from mature transformed plants and used to propagate 
the plant line. Plants may also be crossed and in this manner, contemplated 
herein is the breeding of cells lines and transgenic plants that vary in genetic 
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background. It is also possible to cross a plant line comprising the first 
recombinant polypeptide with a plant line comprising the second recombinant 
polypeptide. Accordingly, also provided herein are methods of producing in a 
plant a recombinant multimeric-protein-complex, said method comprising: 
5 (a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide, such as a redox protein (e.g., a thioredoxin-related 
protein, and the like) or an immunoglobulin-polypeptide-chain, wherein said first 
recombinant polypeptide is capable of associating with said oil bodies through an 

oil-body-targeting-protein; 
10 (b) preparing a second plant comprising cells, said cells comprising oil bodies and 

a second recombinant polypeptide, such as a second redox protein (e.g., a 

thioredoxin-related protein, and the like) or a second immunoglobulin- 

polypeptide-chain; and 

(c) sexually crossing said first plant with said second plant to produce a progeny 
15 plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 
are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 
The second recombinant polypeptide may also associate with the oil 
20 bodies. Accordingly, also provided herein are methods of producing in a plant a 
recombinant multimeric-protein-complex, said method comprising: 

(a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide, such as a redox (or thioredoxin-related) protein or 
immunoglobulin-polypeptide-chain, wherein said first recombinant polypeptide is 

25 capable of associating with said oil bodies through an oil-body-targeting-protein; 

(b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide, such as a second redox (thioredoxin-related) 
protein or a second immunoglobulin-polypeptide-chain, wherein said second 
recombinant polypeptide is capable of associating with said oil bodies through an 

30 oil body targeting protein; and 

(c) sexually crossing said first plant with said second plant to produce a progeny 
plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 
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are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 

Isolation of Oil bodies 
5 The oil bodies provided herein may be obtained from any cell containing 

oil bodies, including any animal cell; plant cell; fungal cell; for example a yeast 
cell, algae cell; or bacterial cell. Any process suitable for the isolation oil bodies 
from cells may be used herein. Processes for the isolation of oil bodies from 
plant seed cells have been described in US Patents (6,146,645 and 6,183,762) 
10 and the isolation of oil bodies from yeast cells has been described by Ting et al. 
(1997) J. Biol. Chem. 272: 3699-3706). 

In certain embodiments, the oil bodies are obtained from a plant cell such 
as for example a pollen cell; a fruit cell; a spore cell; a nut cell; mesocarp cell; for 
example the mesocarp cells obtainable from olive (O/ea europaea) or avocado 
15 (Persea americana); or a seed cell. In particular embodiments the oil bodies are 
obtained from a plant seed cell. The seeds can be obtained from a transgenic 
plant according to the present invention. In particular embodiments, a seed of a 
transgenic plant according to the present invention contains the first and/or 
second recombinant polypeptides, multimeric-protein-complexes, 
20 heteromultimeric-protein-complexes, multimeric-fusion-proteins, 

heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
chains, redox-fusion-polypeptides, or first and/or second thioredoxin-related 
proteins in a concentration of at least about 0.5% of total cellular seed protein. 
In further embodiments, a seed of a transgenic plant provided herein contains a 
25 recombinant polypeptide or multimeric-protein-complex in a concentration of at 
least about 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, 1.0%, 1.25%, 1.5%, 1.75%. 
2.0%, 2.25%. 2.5%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10% or more, of total 
cellular seed protein. The upper limits of the recombinant polypeptide or 
multimeric-protein-complex concentration can be up to about 8%, 9%. 10%. 
30 1 1 %, 12%, 13%, 14%, 15%. Thus, the ranges at least about 0.5% up to 

about 15%; at least about 1.0% up to about 10%; and at least about 5% up to 
about 8% are among the various ranges contemplated herein. 
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Among the plant seeds useful in this regard are plant seeds obtainable 
from the group of plant species consisting of Brazil nut (Betholletia excelsa); 
castor [Rfccinus communis): coconut (Cocus nucifera); coriander (Coriandrum 
sativum): cotton (Gossypium spp.); groundnut (Arachis hypogaea); jojoba 
5 (Simmondsia chinensis); linseed/flax (Linum usitatissimum); maize (Zea mays): 
mustard [Brassica spp. and Sinapis alba); oil palm [E/aeis guineeis); olive (Olea 
europaea); rapeseed (Brassica spp.); safflower (Carthamus tinctorius): soybean 
(Glycine max); squash (Cucurbita maxima): sunflower (Helianthus annuus); 
barley (Hordeum vulgare); wheat [Traeticum aestivum) and mixtures thereof. In 
10 a particular embodiment, oil bodies are obtainable from the seeds obtainable 
from safflower {Carthamus tinctorius). 

In order to prepare oil bodies from plant seeds, plants are grown and 
allowed to set seed in accordance with common agricultural practices. Thus, 
the present invention also provides seeds comprising oil bodies, wherein said oil 
15 bodies further comprise invention multimeric-protein-complexes described herein. 
Upon harvesting the seed and, if necessary the removal of large insoluble 
materials such as stones or seed hulls, by for example sieving or rinsing, any 
process suitable for the isolation of oil bodies from seeds may be used herein. A 
typical process involves grinding of the seeds followed by an aqueous extraction 
20 process. 

Seed grinding may be accomplished by any comminuting process 
resulting in a substantial disruption of the seed cell membrane and cell walls 
without compromising the structural Integrity of the oil bodies present in the 
seed cell. Suitable grinding processes in this regard include mechanical pressing 

25 and milling of the seed. Wet milling processes such as described for cotton 
(Lawhon et al. (1977) J. Am. Oil Chem. Soc. 63: 533-534) and soybean (US 
Patent 3,971,856; Carter et al. (1974) J. Am. Oil Chem. Soc. 51: 137-141) are 
particularly useful in this regard. Suitable milling equipment capable of industrial 
scale seed milling include colloid mills, disc mills, pin mills, orbital mills, IKA mills 

30 and industrial scale homogenizers. The selection of the milling equipment will 
depend on the seed, which is selected, as well as the throughput requirement. 



RECTIFIED SHEET (RULE 91) 



WO 02/50289 



PCT/US01/50240 



-63- 



Solid contaminants such as seed hulls, fibrous materials, undissolved 
carbohydrates, proteins and other insoluble contaminants are subsequently 
preferably removed from the ground seed fraction using size exclusion based 
methodologies such as filtering or gravitational based methods such as a 
5 centrifugation based separation process. Centrifugation may be accomplished 
using for example a decantation centrifuge such as a HASCO 200 2-phase 
decantation centrifuge or an NX310B (Alpha Laval). Operating conditions are 
selected such that a substantial portion of the insoluble contaminants and 
sediments and may be separated from the soluble fraction. 
10 Following the removal of insolubles the oil body fraction may be 

separated from the aqueous fraction. Gravitational based methods as well as 
size exclusion based technologies may be used. Gravitational based methods 
that may be used include centrifugation using for example a tubular bowl 
centrifuge such as a Sharpies AS-16 or AS-46 (Alpha Laval), a disc stack 
15 centrifuge or a hydrocyclone, or separation of the phases under natural 

gravitation. Size exclusion methodologies that may be used include membrane 
ultra filtration and crossflow microfiltration. 

Separation of solids and separation of the oil body phase from the 
aqueous phase may also be carried out concomitantly using gravity based 
20 separation methods or size exclusion based methods. 

The oil body preparations obtained at this stage in the process are 
generally relatively crude and depending on the application of the oil bodies, it 
may be desirable to remove additional contaminants. Any process capable of 
removing additional seed contaminants may be used in this regard. Conveniently 
25 the removal of these contaminants from the oil body preparation may be 

accomplished by resuspending the oil body preparation in an aqueous phase and 
re-centrifuging the resuspended fraction, a process referred to herein as 
"washing the oil bodies". The washing conditions selected may vary depending 
on the desired purity of the oil body fractions. For example where oil bodies are 
30 used in pharmaceutical compositions, generally a higher degree of purity may be 
desirable than when the oil bodies are used in food preparations. The oil bodies 
may be washed one or more times depending on the desired purity and the ionic 
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strength,.pH and temperature may all be varied. Analytical techniques may be 
used to monitor the removal of contaminants. For example SDS gel 
electrophoresis may be employed to monitor the removal of seed proteins. 

The entire oil body isolation process may be performed in a batch wise 
5 fashion or continuous flow. In a particular embodiment, industrial scale 
continuous flow processes are utilized. 

Through the application of these and similar techniques the skilled artisan 
is able to obtain oil bodies from any cell comprising oil bodies. The skilled 
artisan will recognize that generally the process will vary somewhat depending 
10 on the cell type that is selected. However, such variations may be made 
without departing from the scope and spirit of the present invention. 
Association of the first and/or second recombinant polypeptides, multimeric- 
protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin- 
1 5 polypeptide-chains, redox-fusion-polypeptides, the first and/or second 
. thioredoxin-related proteins with oil bodies. 

In accordance with the present invention, the oil bodies are associated 
with either the first and/or second recombinant polypeptides, multimeric-protein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
20 heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypepf.de- 
chains, redox-fusion-polypeptides, the first and/or second thioredoxin-related 
proteins through association with an oil-body-targeting-protein capable of 
association with these multimeric-protein-complexes and the oil bodies. As used 
herein the phrase "associating the oil bodies with the multimeric-protein- 
25 complex" means that the oil bodies are brought in proximity of the multimeric- 
protein-complexes in a manner that allows the association of the oil bodies with 
either the first and/or second recombinant polypeptides, multimeric-prptein- 
complexes, heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
30 chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins. The association of the oil bodies with the multimeric-protein- 
complexes is accomplished by association of the oil-body-targeting-protein with 
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both the oil body and with the multimeric-protein-complex. In particular 
embodiments, the cells expressing the multimeric-protein-complex associate with 
the oil bodies that are obtainable from these same cells, which permits the 
convenient production and isolation of the multimeric-protein-complex, including 
5 the first and/or second recombinant polypeptides, heteromultimeric-protein- 
complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins, in an oil body-comprising 
host cell system. Accordingly, in one embodiment, the association of the oil 
10 body with the multimeric-protein-complex is accomplished intracellular^ during 
the growth of the cell. For example, a redox fusion polypeptide may be fused to 
an oil-body-protein and the chimeric protein may be expressed in oil body- 
containing plant seeds. Isolation of the oil bodies from the seeds in this case 
results in isolation of oil bodies comprising either the first and/or second 
15 recombinant polypeptides, multimeric-protein-complexes, heteromultimeric- 

protein-complexes, multimeric-fusion-proteins, heteromultimeric-fusion-proteins, 
immunoglobulins, immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, 
or the first and/or second thioredoxin-related proteins. In another embodiment, 
in which the multimeric-protein-complex associates with oil bodies obtainable 
20 from the same cells in which the complex is produced, the association of the oil 
bodies with the multimeric-protein-complex is accomplished upon disrupting the 
cell's integrity. 

For example, the first and/or second recombinant polypeptides, 
multimeric-protein-complexes, heteromultimeric-protein-complexes, multimeric- 

25 fusion-proteins, heteromultimeric-fusion-proteins, immunoglobulins, 

immunoglobulin-polypeptide-chains, redox-fusion-polypeptides, or the first and/or 
second thioredoxin-related proteins may be expressed in such a manner that it is 
targeted to the endomembrane system of the seed cells. Oil bodies present in 
the same seed cells comprising an oil-body-targeting-protein capable of 

30 association with these multimeric-protein-complexes, for example an oleosin 
linked to a single chain antibody capable of association with a recombinant 
polypeptide or multimeric-protein-complex, may then associate with the 
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recombinant polypeptide or multimeric-protein-complex upon grinding of the 
seed. 

In accordance with this embodiment, plant seed cells comprising a light 
and heavy chain of an immunoglobulin targeted to the plant apoplast can be 
prepared. These particular seed cells are prepared to further comprise oil bodies 
associated with an oil-body-targeting-protein capable of association with the 
immunoglobulin, such as for example, an oleosin-protein A fusion protein, and 
the like. Upon grinding of the seed, the oil bodies comprising protein A 
associate with the immunoglobulin through binding. 

In yet another embodiment, the oil bodies used to associate with the 
multimeric-protein-complex are obtained from a cellular source different from the 
cell comprising the first and/or second recombinant polypeptides, multimeric- 
protein-complexes, heteromultimeric-protein-complexes, multimeric-fusion- 
proteins, heteromultimeric-fusion-proteins. immunoglobulins, immunoglobulin- 
15 polypeptide-chains, redox-fusion-polypeptides, or the first and/or second 

thioredoxin-related proteins, such as from a separate plant line. For example, oil 
bodies associated with protein A may be prepared from one plant line. These oil 
bodies may then be mixed with ground seeds comprising an apoplastically 
expressed light and heavy chain constituting an immunoglobulin. Alternatively, a 
20 plant line comprising oil bodies associated with protein A may be crossed with a 
plant line comprising an immunoglobulin. 

The first recombinant polypeptide, second recombinant polypeptide and 
oil-body-targeting-protein may also be prepared in separate cellular 
compartments. Association of the first polypeptide, second polypeptide, and oil 
25 body then may occur upon disruption of the cell's integrity. For example, 

various mechanisms for targeting gene products are known to exist in plants, 
and the sequences controlling the functioning of these mechanisms have been 
characterized in some detail. For example, the targeting of gene products to the 
chloroplast is controlled by a transit sequence found at the amino terminal end of 
30 various proteins which is cleaved during chloroplast import to yield the mature 
protein (Comai et el. (1988) J Biol Chem 263: 15104-15109). Other gene 
products are localized to other organelles such as the mitochondrion and the 
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peroxisome (Unger eta/. (1989) Plant Mot Biol 13:411-418). The cDNAs 
encoding these products can be manipulated to target heterologous gene 
products to these organelles. In addition, sequences have been characterized 
which cause the targeting of gene products to other cell compartments. 
5 Amino terminal sequences are responsible for targeting to the ER, the 

apoplast, and extracellular secretion from aleurone cells (Koehler & Ho (1990) 
Plant Cell 2:7 69-7 B3). Additionally, amino terminal sequences in conjunction 
with carboxy terminal sequences are responsible for vacuolar targeting of gene 

products (Shinshi at al., (1 990) Plant Mol Biol 1 4:357-368). By the fusion of the 
10 appropriate targeting sequences described above to transgene sequences of 

interest it is possible to direct the transgene product to the desired organelle or 

cell compartment. 

As hereinbefore mentioned, the redox protein obtained using the methods 
provided herein is enzymatically active while associated with the oil body. 

1 5 Preferably the redox protein is at least 5 times more active when produced as a 
redox fusion polypeptide with a second redox protein relative to its production in 
association with an oil body as a non-fusion polypeptide (i.e. without the second 
redox protein). More preferably the redox protein is at least 10 times more 
active when produced as a redox fusion polypeptide. 

20 The activity of the redox fusion polypeptide may be determined in 

accordance with methodologies generally known to the art (see for example: 
Johnson et al (1984) J. of Bact. Vol. 158 3:1061-1069) and may be optimized 
by for example the addition of detergents, including ionic and non-ionic 

detergents. 
25 Formulation of Oil Bodies 

In accordance with a particular embodiment, the oil bodies comprising the 
first and/or second recombinant polypeptides, multimeric-protein-complexes, 
heteromultimeric-protein-complexes, multimeric-fusion-proteins, 
heteromultimeric-fusion-proteins, immunoglobulins, immunoglobulin-polypeptide- 
30 chains, redox-fusion-polypeptides, or the first and/or second thioredoxin-related 
proteins, are preferably formulated into an emulsion. The emulsion is preferably 
used in the preparation of a pharmaceutical composition, personal care or a food 
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product. In emulsified form, the oil body offers certain desirable properties, such 
as for example excellent compatibility with the human skin. 

It particular embodiments, the oil body formulation is stabilized so that a 
final product may be obtained which may be stored and preserved for longer 
periods of time. As used herein, the term "stabilized oil body preparation" refers 
to an oil body preparation that is prepared so that the formulation does not 
undergo undesirable physical or chemical alterations when the oil body 
preparation is stored. The stabilization requirements may vary depending on the 
final product. For example personal care products are preferably stable for at 
least one year at room temperature while additionally being able to withstand 
short temperature fluctuations. Pharmaceutical formulations may in some cases 
be less stable as they may be stored at lower temperatures thereby preventing 
the occurrence of undesirable reactions. 

In general, stabilization techniques that may be used herein include any 
and all methods for the preservation of biological material including the addition 
of chemical agents, temperature modulation based methodologies, radiation- 
. based technologies and combinations thereof. In particular embodiments small 
amounts of stabilizing chemical agents are mixed, with the oil body formulation 
to achieve stabilization. These chemical agents include inter alia preservatives, 
20 antioxidants, acids, salts, bases, viscosity modifying agents, emulsifiers, gelling 
agents and mixtures thereof and may all be used to stabilize the oil body 
preparation. In view of the presence of the redox fusion polypeptide the 
stabilizing agent is generally selected to be compatible with and resulting in good 
enzymatic function of the redox fusion polypeptide. 

Diagnostic parameters to assess the stability of the oil body preparation 
may be as desired and include all parameters indicative of undesirable qualitative 
or quantitative changes with respect to chemical or physical stability. Typical 
parameters to assess the oil body preparation over time include color, odor, 
viscosity, texture, P H and microbial growth, and enzymatic activity. 
30 In particular embodiments, the oil body formulation is stabilized prior to 

the addition of further ingredients that may be used to prepare the final product. 
Howevera, in other embodiments, it is nevertheless possible to formulate the 
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final formulation using non-stabilized oil bodies and stabilize the final formulation. 

The final preparations may be obtained using one or more additional 
ingredients and any formulation process suitable for the preparation of a 
formulation comprising oil bodies. Ingredients and processes employed will 
5 generally vary depending on the desired use of the final product, will be art 

recognized and may be as desired. Ingredients and processes that may be used 
herein include those described in US Patents (US Patents 6,146,645 and 
6,183,762) which are incorporated by reference herein. 

In particular embodiments, the redox fusion polypeptide comprises a 
10 thioredoxin and a thioredoxin-reductase. Accordingly, provided herein are oil 
bodies comprising a thioredoxin/thioredoxin-reductase fusion polypeptide. Also 
provided herein is a formulation containing oil bodies comprising a 
thioredoxin/thioredoxin-reductase fusion capable of treating or protecting a target 
against oxidative stress. The stress of the target is treated or prevented by 
1 5 contacting the target with the formulation. The target may be any substance 
susceptible to oxidative stress, including any molecule, molecular complex, cell, 
tissue or organ. 

In another embodiment, provided herein is a formulation containing oil 
bodies comprising a thioredoxin/thioredoxin-reductase fusion capable of 
20 chemically reducing a target. Contacting the target with the formulation reduces 
the target. The target may be any substance susceptible to reduction, including 
any molecule or molecular complex. Particularly susceptible targets in this 
regard are the disulfide bonds present in proteins. 

The oil bodies comprising thioredoxin/thioredoxin-reductase may be used 
25 to prepare formulations used to reduce the allergenic^ of food or increase.the 
digestibility of food. Preferably, the method of reducing the food allergenic^ is 
practiced by mixing the thioredoxin/thioredoxin-reductase comprising oil bodies 
with food or food ingredients selected from a variety of sources including for 
example wheat flour, wheat dough, milk, cheese, soya, yogurt and ice cream. 
30 The thioredoxin/thioredoxin-reductase comprising oil bodies may also be used to 
increase the digestibility of milk as well as other disulfide containing proteins 
(Jiao, J. et al. (1992) J. Agric. Food Chem 40: 2333-2336). Further food 
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applications include the use of the oil thioredoxin/thioredoxin-reductase 
comprising oil bodies as a food additive to enhance dough strength and bread 
quality properties (Wong et al., (1993) J. Cereal Chem. 70: 113-114; Kobrehel 
et al. (1994) Gluten Proteins: Association of Cereal Research; Detmold, 
5 Germany). 

Also provided herein are pharmaceutical compositions comprising, in a 
pharmaceutical^ active carrier: oil bodies comprising a thioredoxin/thioredoxin- 
reductase; oil bodies comprising multimeric-protein-complexes. such as 
. heteromultimeric-protein-complexes; isolated thioredoxin/thioredoxin-reductase 
10 fusion proteins; or isolated multimeric-protein-complexes. These pharmaceutical 
compositions may be used for the treatment of reperfusion injury (Aota et al. 
(1996) J. Cardiov. Pharmacol. (1996) 27: 727-732), cataracts (US Patent US 
4,771,036), chronic obstructive pulmonary disease (COPD) (MacNee et al. 
(1999) Am. J. Respir. Crit. Care Med. 1 60:S58-S65), diabetes (Hotta et al. J. 
15 Exp. Med. 188: 1445-1451), envenomation (PCT Patent Application 99/201 22; 
US Patent 5,792,506), bronchopulmonary disease (MacNee (2000) Chest 
117:3035-3175); malignancies (PCT Patent Application 91/04320) and the 
alleviation of the allergenic potential of airborne, for example pollen- derived, and 
contact allergens (PCT Patent Application 00/44781). Other diseases or 
20 conditions that may be treated with the pharmaceutical compositions provided 
herein include: psoriasis, wound healing, sepsis, Gl bleeding, intestinal bowel 
disease (IBD), ulcers, transplantation, GERD (gastro esophageal reflux disease). 

In another embodiment, the pharmaceutical compositions provided herein, 
particularly those comprising one or more redox proteins alone or in combination 
25 with oil bodies, can be used in the treatment of inflammatory and viral diseases 
by reductively inactivating phospholipase A2, one of the contributing factors in 
inflammatory diseases. Additionally, the redox fusion polypeptide system has 
been found to function as a self-defense mechanism in response to 
environmental stimuli, including oxidative stress caused by UV-generated free 
30 radicals. Consequently, redox proteins, e.g., oleosin-thioredoxin, oleosin- 

thioredoxin-reductase, the various redox fusion polypeptides described herein, 
provide beneficial effects in certain skin conditions such as psoriasis, skin 
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cancer. dandruff, diaper rash, dermatitis, acne, sun damage, aging, 

inflammation, and the like. 

In another embodiment, oil-body-thioredoxin-related fusion proteins, e.g., 

oleosin-Thioredoxin-reductase, can also be used as a venom antidote. Many 
5 animal venoms and other toxins contain disulfide bonds, including all snake 

venom neurotoxins, some bacterial neurotoxins including tetanus and botulinum 
A, bee venom phospholipase A 2 , and scorpion venom. In a further embodiment, 
the redox protein related pharmaceutical compositions provided herein can be 
used to inactivate venom toxins by reduction of disulfide bonds. A method of 
10 treating an individual suffering from the effects of a venom or toxin can include 
the step of administering an effective dose of a pharmaceutical composition, in a 
pharmaceutical effective carrier in an amount sufficient to relieve or reverse the 
effects of the venom toxin on the individual. 

The pharmaceutical compositions provided herein are preferably 
15 formulated for single dosage administration. The concentrations of the 

compounds in the formulations are effective for delivery of an amount, upon 
administration, that is effective for the intended treatment. Typically, the 
compositions are formulated for single dosage administration. To formulate a 
composition, the weight fraction of a compound or mixture thereof is dissolved, 
20 suspended, dispersed or otherwise mixed in a selected vehicle at an effective 
concentration such that the treated condition is relieved or ameliorated. 
Pharmaceutical carriers or vehicles suitable for administration of the compounds 
provided herein include any such carriers known to those skilled in the art to be 
suitable for the particular mode of administration. 
25 In addition, the compounds may be formulated as the sole 

pharmaceutical^ active ingredient in the composition or may be combined with 
other active ingredients. Liposomal suspensions, including tissue-targeted 
liposomes, may also be suitable as pharmaceutical^ acceptable carriers. These 
may be prepared according to methods known to those skilled in the art. For 
30 example, liposome formulations may be prepared as described in U.S. Patent No. 
4,522,811. 
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The active compound is included in the pharmaceutical^ acceptable 
carrier in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. The therapeutically 
effective concentration may be determined empirically by testing the compounds 
5 in known in vitro and in vivo systems, such as the assays provided herein. 

The concentration of active compound in the drug composition will 
depend on absorption, inactivation and excretion rates of the active compound, 
the physicochemical characteristics of the compound, the dosage schedule, and 
amount administered as well as other factors known to those of skill in the art. 
10 Typically a therapeutically effective dosage is contemplated. The 

amounts administered may be on the order of 0.001 to 1 mg/ml, preferably 
about 0.005-0.05 mg/ml, more preferably about 0.01 mg/ml, of blood volume. 
Pharmaceutical dosage unit forms are prepared to provide from about 1 mg to 
about 1000 mg and preferably from about 10 to about 500 mg, more preferably 
15 about 25-75 mg of the essential active ingredient or a combination of essential 
ingredients per dosage unit form. The precise dosage can be empirically 
determined. 

The active ingredient may be administered at once, or may be divided into 
a number of smaller doses to be administered at intervals of time. It is 
20 understood that the precise dosage and duration of treatment is a function of the 
disease being treated and may be determined empirically using known testing 
protocols or by extrapolation from in yiyo or in yjtro test data. It is to be noted 
that concentrations and dosage values may also vary with the severity of the 
condition to be alleviated. It is to be further understood that for any particular 
25 subject, specific dosage regimens should be adjusted over time according to the 
individual need and the professional judgment of the person administering or 
supervising the administration of the compositions, and that the concentration 
ranges set forth herein are exemplary only and are not intended to limit the 
scope or use of the claimed compositions and combinations containing them. 
30 ' Preferred pharmaceutical^ acceptable derivatives include acids, salts, 

esters, hydrates, solvates and prodrug forms. The derivative is typically selected 
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such that its pharmacokinetic properties are superior to the corresponding 

neutral compound. 

Thus, effective concentrations or amounts of one or more of the 
compounds provided herein or pharmaceutical^ acceptable derivatives thereof 
5 are mixed with a suitable pharmaceutical carrier or vehicle for systemic, topical 
or local administration to form pharmaceutical compositions. Compounds are 
included in an amount effective for ameliorating or treating the disorder for 
which treatment is contemplated. The concentration of active compound in the 
composition will depend on absorption, inactivation, excretion rates of the active 
10 compound, the dosage schedule, amount administered, particular formulation as 
well as other factors known to those of skill in the art. 

Solutions or suspensions used for parenteral, intradermal, subcutaneous, 
or topical application can include any of the following components: a sterile 
diluent, such as water for injection, saline solution, fixed oil, polyethylene glycol, 
15 glycerine, propylene glycol or other synthetic solvent; antimicrobial agents, such 
as benzyl alcohol and methyl parabens; antioxidants, such as ascorbic acid and 
sodium bisulfite; chelating agents, such as ethylenediaminetetraacetic acid 
(EDTA); buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 
20 preparations can be enclosed in ampules, disposable syringes or single or 
multiple dose vials made of glass, plastic or other suitable material. 

In instances in which the compounds exhibit insufficient solubility, 
methods for solubilizing compounds may be used. Such methods are known to 
those of skill in this art, and include, but are not limited to, using cosolvents, 
25 such as dimethylsulfox.de (DMSO), using surfactants, such as Tween®, or 

dissolution in aqueous sodium bicarbonate. Derivatives of the compounds, such 
as prodrugs of the compounds may also be used in formulating effective 
pharmaceutical compositions. For ophthalmic indications, the compositions are 
formulated in an ophthalmically acceptable carrier. For the ophthalmic uses 
30 herein, local administration, either by topical administration or by injection is 
preferred. Time release formulations are also desirable. Typically, the 
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compositions are formulated for single dosage administration, so that a single 
dose administers an effective amount. 

Upon mixing or addition of the compound with the vehicle, the resulting 
mixture may be a solution, suspension, emulsion or other composition. The form 
5 of the resulting mixture depends upon a number of factors, including the 
intended mode of administration and the solubility of the compound in the 
selected carrier or vehicle. If necessary, pharmaceutical^ acceptable salts or 
other derivatives of the compounds are prepared. 

The compound is included in the pharmaceutical^ acceptable carrier in an 
10 amount sufficient to exert a therapeutically useful effect in the absence of 

undesirable side effects on the patient treated. It is understood that number and 
degree of side effects depends upon the condition for which the compounds are 
■ administered. For example, certain toxic and undesirable side effects are 
tolerated when treating life-threatening illnesses that would not be tolerated 
15 when treating disorders of lesser consequence. 

The compounds can also be mixed with other active materials, that do 
not impair the desired action, or with materials that supplement the desired 
action known to those of skill in the art. The formulations of the compounds 
. and agents for use herein include those suitable for oral, rectal, topical, 
20 inhalational, buccal [e.g., sublingual), parenteral (e.g.. subcutaneous, 

intramuscular, intradermal, or intravenous), transdermal administration or any 
route. The most suitable route in any given case will depend on the nature and 
severity of the condition being treated and on the nature of the particular active 
compound which is being used. The formulations are provided for administration 
25 to humans and animals in unit dosage forms, such as tablets, capsules, pills, 

powders, granules, sterile parenteral solutions or suspensions, and oral solutions 
or suspensions, and oil-water emulsions containing suitable quantities of the 
compounds or pharmaceutical^ acceptable derivatives thereof. The 
pharmaceutical therapeutically active compounds and derivatives thereof are 
30 typically formulated and administered in unit-dosage forms or multiple-dosage 

forms. Unit-dose forms as used herein refers to physically discrete units suitable 
for human and animal subjects and packaged individually as is known in the art. 
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Each unit-dose contains a predetermined quantity of the therapeutically active 
compound sufficient to produce the desired therapeutic effect, in association 
with the required pharmaceutical^ acceptable carrier, vehicle or diluent. 
Examples of unit-dose forms include ampoules and syringes and individually 
5 packaged tablets or capsules. Unit-dose forms may be administered in fractions 
or multiples thereof. A multiple-dose form is a plurality of identical unit-dosage 
forms packaged in a single container to be administered in segregated unit-dose 
form. Examples of multiple-dose forms include vials, bottles of tablets or 
capsules or bottles of pints or gallons. Hence, multiple dose form is a multiple of 
10 unit-doses which are not segregated in packaging. 

The composition can contain along with the active ingredient: a diluent 
such as lactose, sucrose, dicalcium phosphate, or carboxymethylcellulose; a 
lubricant, such as magnesium stearate, calcium stearate and talc; and a binder 
such as starch, natural gums, such as gum acaciagelatin, glucose, molasses, 
15 polivinylpyrrolidine, celluloses and derivatives thereof, povidone, crospovidones 
and other such binders known to those of skill in the art. Liquid 
pharmaceutical^ administrate compositions can, for example, be prepared by 
dissolving, dispersing, or otherwise mixing an active compound as defined above 
and optional pharmaceutical adjuvants in a carrier, such as, for example, water, 
20 saline, aqueous dextrose, glycerol, glycols, ethanol, and the like, to thereby form 
a solution or suspension. If desired, the pharmaceutical composition to be 
administered may also contain minor amounts of nontoxic auxiliary substances 
such as wetting agents, emulsifying agents, or solubilizing agents, pH buffering 
agents and the like, for example, acetate, sodium citrate, cyclodextrine 
25 derivatives, sorbitan monolaurate, triethanolamine sodium acetate. 

triethanolamine oleate, and other such agents. Methods of preparing such 
dosage forms are known, or will be apparent, to those skilled in this art (see, 
e.g., Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, 
Pa., 15th Edition, 1975). The composition or formulation to be administered will 
30 contain a quantity of the active compound in an amount sufficient to alleviate 
the symptoms of the treated subject. 
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Dosage forms or compositions containing active ingredient in the range of 
0.005% to 100% with the balance made up from non-toxic carrier may be 
prepared. For oral administration, the pharmaceutical compositions may take the 
form of, for example, tablets or capsules prepared by conventional means with 
5 pharmaceutical^ acceptable excipients such as binding agents (e.g., 
pregelatinized maize starch, polyvinyl pyrrolidone or hydroxypropyl 
methylcellulose); fillers (e.g.. lactose, macrocrystalline cellulose or calcium 
hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch g.ycolate); or wetting agents 
10 (e.g., sodium lauryl sulphate). The tablets may be coated by methods well- 
known in the art. 

The pharmaceutical preparation may also be in liquid form, for example, 
solutions, syrups or suspensions, or may be presented as a drug product for 
reconstitution with water or other suitable vehicle before use. Such liquid 

15 preparations may be prepared by conventional means with pharmaceutical^ 
acceptable additives such as suspending agents (e.g.. sorbitol syrup, cellulose 
derivatives or hydrogenated edible fats); emulsifying agents (e.g., lecithin or 
acacia); non-aqueous vehicles (e.g., almond oil, oily esters, or fractionated 
vegetable oils); and preservatives (e.g., methyl or propyl-p-hydroxybenzoates or 

20 sorbic acid). 

Formulations suitable for rectal administration are preferably presented as 
unit dose suppositories. These may be prepared by admixing the active 
compound with one or more conventional solid carriers, for example, cocoa 
butter, and then shaping the resulting mixture. 

25 Formulations suitable for topical application to the skin or to the eye 

preferably take the form of an ointment, cream, lotion, paste, gel, spray, aerosol 
and oil. Carriers which may be used include vaseline, lanoline, polyethylene 
glycols, alcohols, and combinations of two or more thereof. The topical 
formulations may further advantageously contain 0.05 to 15 percent by weight 

30 of thickeners selected from among hydroxypropyl methyl cellulose, methyl 
cellulose, polyvinylpyrrolidone, polyvinyl alcohol, poly (alkylene glycols), 
poly/hydroxyalkyl, (meth)acrylates or P oly(meth)acrylamides. A topical 
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formulation is often applied by instillation or as an ointment into the conjunctival 
sac. It can also be used for irrigation or lubrication of the eye, facial sinuses, 
and external auditory meatus. It may also be injected into the anterior eye 
chamber and other places. The topical formulations in the liquid state may be 
5 also present in a hydrophilic three-dimensional polymer matrix in the form of a 
strip, contact lens, and the like from which the active components are released. 

For administration by inhalation, the compounds for use herein can be 
delivered in the form of an aerosol spray presentation from pressurized packs or 
a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
10 trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other 
suitable gas. In the case of a pressurized aerosol, the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of, e.g., gelatin, for use in an inhaler or insufflator may be formulated 
containing a powder mix of the compound and a suitable powder base such as 

1 5 lactose or starch. 

Formulations suitable for buccal (sublingual) administration include, for 
example, lozenges containing the active compound in a flavored base, usually 
sucrose and acacia or tragacanth; and pastilles containing the compound in an 
inert base such as gelatin and glycerin or sucrose and acacia. 
20 The compounds may be formulated for parenteral administration by 

injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose 
containers, with an added preservative. The compositions may be suspensions, 
solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
25 agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active ingredient may be in powder form for ^constitution with a suitable 
vehicle, e.g., sterile pyrogen-free water or other solvents, before use. 

Formulations suitable for transdermal administration may be presented as 
discrete patches adapted to remain in intimate contact with the epidermis of the 
30 recipient for a prolonged period of time. Such patches suitably contain the 

active compound as an optionally buffered aqueous solution of, for example, 0.1 
to 0.2 M concentration with respect to the active compound. Formulations 
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suitable for transdermal administration may also be delivered by iontophoresis 
[see, e.g.. Pharmaceutical Research 3 (6), 318 (1986)) and typically take the 
form of an optionally buffered aqueous solution of the active compound. 

The pharmaceutical compositions may also be administered by controlled 
5 release means and/or delivery devices (see, e.g., in U.S. Patent Nos. 3,536,809; 
3,598,123; 3,630,200; 3,845,770; 3,847,770; 3,916,899; 4,008,719; 
4,687,610; 4,769,027; 5,059,595; 5,073,543; 5,120,548; 5,354,566; 
5,591,767; 5,639,476; 5,674,533 and 5,733,566). 

Desirable blood levels may be maintained by a continuous infusion of the 
10 active agent as ascertained by plasma levels. It should be noted that the 

attending physician would know how to and when to terminate, interrupt or 
adjust therapy to lower dosage due to toxicity, or bone marrow, liver or kidney 
dysfunctions. Conversely, the attending physician would also know how to and 
when to adjust treatment to higher levels if the clinical response is not adequate 

15 (precluding toxic side effects). 

The efficacy and/or toxicity of the pharmaceutical compositions provided 
herein, alone or in combination with other agents can also be assessed by the 
methods known in the art (See generally, O'Reilly, Investigational New Drugs, 
15:5-13(1997)). 

20 The active compounds or pharmaceutical^ acceptable derivatives may be 

prepared with carriers that protect the compound against rapid elimination from 
the body, such as time release formulations or coatings. 

Kits containing the compositions and/or the combinations with 
instructions for administration thereof are provided. The kit may further include 
25 a needle or syringe, preferably packaged in sterile form, for injecting the 

complex, and/or a packaged alcohol pad. Instructions are optionally included for 
administration of the active agent by a clinician or by the patient. 

Finally, the pharmaceutical compositions provided herein containing any 
of the preceding agents may be packaged as articles of manufacture containing 
30 packaging material, a compound or suitable derivative thereof provided herein, 
which is effective for treatment of a diseases or disorders contemplated herein, 
within the packaging material, and a label that indicates that the compound or a 
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suitable derivative thereof is for treating the diseases or disorders contemplated 
herein. The label can optionally include the disorders for which the therapy is 
warranted. 

Also provided herein are personal care formulations containing oil bodies 
5 comprising a thioredoxin/thioredoxin-reductase fusion polypeptide. Personal care 
products comprising thioredoxin and thioredoxin-reductase are disclosed in for 
example Japanese Patent Applications JP9012471A2, JP103743A2, and 
JP1 129785 A2 Personal care formulations that may be prepared in accordance 
with the present invention include formulations capable of improving the physical 
10 appearance of skin exposed to detrimental environmental stimuli resulting in 
oxidative stress for example oxidative stress caused by UV-generated free- 
radicals. The oil bodies comprising thioredoxin/thioredoxin-reductase may also 
be used to prepare hair care products as described in US Patent Nos. 4,935,231 
and 4,973,475 (incorporated herein by reference in their entirety). 
15 The following examples are included for illustrative purposes only and are 

not intended to limit the scope of the invention. 

EXAMPLE 1 

u»| a *inn nf thioredoxin and NADP H thioredoxin-reductase genes 

An Arabidopsis silique cDNA library CD4-1 2 was obtained from the 
20 Arabidopsis Biological Resource Centre (ABRC, http://aims.cps.msu.edu) 
Arabidopsis stock centre and used as a template for the isolation of the 
thioredoxin h (Trxh) and thioredoxin-reductase genes from Arabidopsis. For the 
isolation of the Trxh gene the following primers were synthesized: 
GVR833: 5' TA CCATGG CTTCGGAAGAAGGA 3' (SEQ ID NO:1) 
25 The sequence identical to the 5' end of the Trxh gene as published in 

Rivera-Madrid et al, (1993) Plant Physiol 102: 327-328, is indicated in bold. 
Underlined is an Ncol restriction site to facilitate cloning. GVR834: 5' 
G A A AGCTT A AG CC AAGTGTTTG 3' (SEQ ID NO:2) 

The sequence complementary to the 3' end of the Trxh gene as published 
30 in Rivera-Madrid et al, (1993) Plant Physiol 102: 327-328, is indicated in bold. 
Underlined is an Hindlll restriction site to facilitate cloning. 
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A Polymerase Chain Reaction (PCR) was carried out using QVR833 and 
GVR834 as primers and the cDNA library CD4-12 as a template. The resulted 
PCR fragment was isolated, cloned into pBluescript and sequenced. The isolated 
sequence encoding Trxh was identical to the published Trxh gene sequence 
5 (Rivera-Madrid et al, (1993) Plant Physiol 102: 327-328). The pBluescript vector 
containing the Trxh gene is called pSBS2500. 

For the isolation of the thioredoxin-reductase gene the following primers 

were synthesized; 

GVR836: 5' GGCCAGCACACTACCATGAATGGTCTCGAAACTCAC 3' (SEQ ID 
10 NO-3) The sequence identical to the 5' end of the thioredoxin-reductase gene 
as published (Jacquot et al, J Mo. Biol. (1994) 235 (41:1357-63), is indicated in 
bold). 

GVR837: 5' TTAAGCTTCAATCACTCTTACCTTGCTG 3' (SEQ ID NO:4). 

A Polymerase Chain Reaction (PCR) was carried out using GVR836 and 
1 5 GVR837 as primers and the cDNA library CD4-1 2 as a template. The resulted 
PCR fragment was isolated, cloned into pBluescript and sequenced. The 
pBluescript vector containing the thioredoxin-reductase gene is called 
pSBS2502. 

A total of three clones were sequenced, the sequence of each of the 
three clones were identical to each other. However, as depicted in Figure 1 this 
sequence indicated several nucleotide differences compared to the published 
thioredoxin-reductase gene sequence published (Jacquot et al, J Mol Biol. (1994) 
235 (4):1 357-63.). The complete coding sequence and its deduced amino ac.d 
sequence is shown in SEQ ID NO:10. As a result of the nucleotide differences 
25 between the published sequence and the sequence isolated in Example 1 , 

several amino acid changes are also predicted. A comparison of the deduced 
amino acid sequence of the published NADPH thioredoxin-reductase sequence 
thioredoxin-reductase (ATTHIREDB, Jacquot et al, J Mol Biol. (1994) 235 
(41:1357-63.) with the sequence isolated in Example 1 (TR) is shown in Figure 3. 

30 
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EXAMPLE 2 

Construction of plant expr «sslon vectors. 

Expression vectors were constructed to allow for the seed specific over- 
expression of thioredoxin and NADPH thioredoxin-reductase in seeds. Vectors 
5 were constructed to allow for over-expression in its natural subcellular location 
and for accumulation on oil bodies. 

Construction of plant transformation vector pSBS2520. 
The Arabidopsis thioredoxin h gene as described in example 1 was placed under 
the regulatory control of the phaseolin promoter and the phaseolin terminator 
10 derived from the common bean Phaseolus vulgaris (Slightom et al (1 983) Proc. 
Natl Acad Sc USA 80: 1 897-1901 ; Sengupta-Gopalan et al., (1985) PNAS USA 
82: 3320-3324)). A gene splicing by overlap extension technique {Horton et al 

(1989) 1 5: 61-68) was used to fuse the phaseolin promoter to the Trxh gene. 
Standard molecular biology laboratory techniques (see eg: Sambrook et al. , 

(1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor Press) were used to furnish 
the phaseolin promoter and terminator with Pst I and Hindlll/Kpnl sites 
respectively (see SEQ ID NO:14). Standard molecular biology laboratory 
techniques were also used to place the phaseolin terminator downstream from 
the Trxh gene. The Pstl-phaseolin promoter- Trxh-phaseolin terminator-Kpnl 
insert sequence was cloned into the Pstl-Kpnl sites of pSBS3000 (pSBSSOOO is 
a derivative from the Agrobacterium binary plasmid pPZP221 (Hajdukiewicz et 
al., 1994, Plant Molec. Biol. 25: 989-994). In P SBS3000, the CaMV35S 
promoter-gentamycin resistance gene-CAMV 35S terminator of pPZP221 was 
replaced with parsley ubiquitin promoter-phosphinothricin acetyl transferase 

26 gene-parsley ubiquitin termination sequence to confer resistance to the herbicide 
glufosinate ammonium.) The resulting plasmid is called pSBS2520. The 
sequence of the phaseolin promoter-^oWops/s Trxh-phaseolin terminator 

sequence is shown in SEQ ID NO:14. 

Construction of plant transformation vector pSBS2510. 
The 3' coding sequence of an Arabidopsis oleosin gene (van Rooijen et al (1992) 
Plant Mol. Biol. 18: 1 177-1179) was altered to contain an Ncol site. The Ncol- 
Hindlll fragment from vector pSBS2500 (Example 1) containing the Trxh was 
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ligated to the coding sequence of this Arabidopsis oleosin utilizing this Ncol 
restriction site. A gene splicing by overlap extension technique (Horton et al 

(1989) 15: 61-68) was used to fuse the phaseolin promoter (Slightom et al 
(1 983) Proc. Natl Acad Sc USA 80: 1 897-1 901 ; Sengupta-Gopalan et al., 

5 (1985) PNAS USA 82: 3320-3324) containing a synthetic Pstl site (see 

construction of pSBS2520) to the coding sequence of the Arabidopsis oleosin. 
Standard molecular biology laboratory techniques (see eg: Sambrook et al. 

(1990) Molecular Cloning, 2"" ed. Cold Spring Harbor Press) were again used to 
clone the Hindlll Kpnl fragment containing the phaseolin terminator (see 

10 construction of P SBS2520) downstream of the Trxh gene. The Pstl-phaseolin 
promoter- oleosin- Trxh-phaseolin terminator-Kpnl insert sequence was cloned 
into the Pstl-Kpnl sites of pSBS3000. The resulting plasmid is called pSBS2510. 
The sequence of the phaseolin promoter-oleosin Trxh-phaseolin terminator 
sequence is shown in SEQ ID NO-.16. 
1 5 Construction of plant transformation vector pSBS252 1. 

This vector contains the same genetic elements as the insert of pSBS2510 
except the Trxh gene is fused to the 5' end of the oleosin gene.The 3' oleosin 
coding sequence including its native stopcodon (van Rooijen et al (1 992) Plant 
Mol. Biol.18: 1 177-1179) was furnished with a Hindlll cloning site. Again a 
20 gene splicing by overlap extension technique (Horton et al (1989) 15: 61-68) 
was used to fuse the phaseolin promoter to the Trxh gene and to fuse the Trxh 
gene to the oleosin sequence. Standard molecular biology laboratory techniques 
(see eg: Sambrook et al. (1990) Molecular Cloning, 2" d ed. Cold Spring Harbor 
Press) were again used to clone the Hindlll Kpnl fragment containing the 
25 phaseolin terminator (see construction of pSBS2520) downstream of the oleosin 
gene. The Pstl-phaseolin promoter- Trxh oleosin- phaseolin terminator-Kpnl 
insert sequence was cloned into the Pstl-Kpnl sites of P SBS3000. The resulting 
plasmid is called P SBS2521. The sequence of the phaseolin promoter- Trxh 
oleosin -phaseolin terminator sequence is shown in SEQ ID NO:19. 
30 Construction of plant transformation vector pSBS2527. 

The Arabidopsis NADPH thioredoxin-reductase gene as described in example 1 
was placed under the regulatory control of the phaseolin promoter and the 
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phaseolin terminator derived from the common bean Phaseolus vulgaris 
(Slightom et al (1983) Proc. Natl Acad Sc USA 80: 1897-1901 ; Sengupta- 
Gopalan et al.. (1985) PNAS USA 82: 3320-3324). A gene splicing by overlap 
extension technique (Horton et al (1989) 15: 61-68) was used to fuse the 
S phaseolin promoter to the thioredoxin-reductase gene. Standard molecular 

biology laboratory techniques (see eg: Sambrook et al. (1990) Molecular Cloning, 
2 nd ed. Cold Spring Harbor Press) were used to furnish the phaseolin promoter 
and terminator with Pstl and Hindlll/Kpnl sites respectively (see SEQ ID N0:14). 
Standard molecular biology laboratory techniques were also used to place the 
10 phaseolin terminator downstream from the thioredoxin-reductase gene. The Pstl- 
phaseolin promoter-thioredoxin-reductase-phaseolin terminator-Kpnl insert 
sequence was cloned into the Pstl-Kpnl sites of P SBS3000 The resulting plasmid 
is called pSBS2527. The sequence of the phaseolin promoter-ZlraM/ops/s 
thioredoxin-reductase-phaseolin terminator sequence is shown in SEQ ID NO.-22. 
1 5 Construction of plant transformation vector pSBS2S31 . 

A gene splicing by overlap extension technique (Horton et al (1989) 15: 61-68) 
was used to fuse the phaseolin promoter (Slightom et al (1 983) Proc. Natl Acad 
Sc USA 80: 1 897-1 901 ; Sengupta-Gopalan et al., (1985) PNAS USA 82: 3320- 
3324) to the coding sequence of the Arabidopsis oleosin. The same gene 
20 splicing technique was used to fuse the oleosin gene to the thioredoxin- 
reductase coding sequence. Standard molecular biology laboratory techniques 
(see eg: Sambrook et al. (1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor 
Press) were again used to clone the Hindlll Kpnl fragment containing the 
phaseolin downstream of the thioredoxin-reductase gene. The Pstl-phaseolin 
25 promoter- oleosin- thioredoxin-reductase -phaseolin terminator-Kpnl insert 
sequence was cloned into the Pstl-Kpnl sites of pSBS3000. The resulting 
plasmid is called pSBS2531. The sequence of the phaseolin promoter-oleosin 
thioredoxin-reductase -phaseolin terminator sequence is shown in SEQ ID N0:24. 
Construction of plant transformation vector pSBS2529 
30 This vector contains the same genetic elements as the insert of pSBS2531 

except the thioredoxin-reductase gene is fused to the 5' end of the oleosin gene. 
The 3" oleosin coding sequence including its native stopcodon (van Rooijen et al. 
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(1992) Plant Mol. Biol. 18: 1 177-1 179) was furnished with a Hindlll cloning site. 
Again a gene splicing by overlap extension technique (Horton et al (1989) 15: 
61-68) was used to fuse the phaseolin promoter to the thioredoxin-reductase 
gene and to fuse the thioredoxin-reductase gene to the oleosin sequence. 
5 Standard molecular biology laboratory techniques (see eg: Sambrook et al. 

(1990) Molecular Cloning, 2 nd ed. Cold Spring Harbor Press) were again used to 
clone the Hindlll Kpnl fragment containing the phaseolin terminator (see 
construction of pSBS2520) downstream of the oleosin gene. The Pstl-phaseolin 
promoter- thioredoxin-reductase oleosin- phaseolin terminator-Kpnl insert 
10 sequence was cloned into the Pstl-Kpnl sites of pSBS30OO. The resulting 
plasmid is called pSBS2529. The sequence of the phaseolin promoter- 
thioredoxin-reductase oleosin -phaseolin terminator sequence is shown in SEQ ID 

NO:27. 

Construction of plant transformation vector pSBS2530. 

1 5 A plant transformation was constructed containing the Mycobacterium Leprae 
thioredoxin-reductase /thioredoxin gene [Mlep TR/Trxh). A construct called 
pHIS/TR/Trxh (Wieles et al (1995) J Biol Chem 270:25604-25606) was obtained 
from the department of Immunohematology and Blood bank, Leiden University, 
The Netherlands and use as a template for PCR to generate pSBS2530. The 

20 construction of pSBS2530 was identical to the construction of P SBS2531 
except that the Mlep TR/Trxh gene was used instead of the Arabidopsis 
thioredoxin-reductase gene. A gene splicing by overlap extension technique 
(Horton et al (1989) 15: 61-68) was used to fuse the phaseolin promoter 
(Sllghtom et al (1983) Proc. Natl Acad Sc USA 80: 1897-1901; Sengupta- 

25 Gopalan et al., (1985) PNAS USA 82: 3320-3324) to the coding sequence of 

the Arabidopsis oleosin. The same gene splicing technique was used to fuse the 
oleosin gene to the Mlep TR/Trxh coding sequence. Standard molecular biology 
laboratory techniques (see eg: Sambrook et al. (1990) Molecular Cloning, 2"< ed. 
Cold Spring Harbor Press) were again used to clone the Hindlll-Kpnl fragment 
30 containing the phaseolin downstream of the Mlep TR/Trxh gene. The Pstl- 
phaseolin promoter- oleosin- Mlep TR/Trxh -phaseolin terminator-Kpnl insert 
sequence was cloned into the Pstl-Kpnl sites of P SBS3000. The resulting 

RECTIFIED SHEET (RULE 91) 



WO 02/50289 



PCTVUS01/50240 



-85- 



plasmid is called P SBS2530. The sequence of the phaseolin promoter-oleosin 
Mlep TR/Trxh -phaseolin terminator sequence is shown in SEQ ID NO:30. 

Construction of plant transformation vector pSBS2542. 
From initial activity assays (Figure 4), it was apparent that oil bodies expressing 
5 the oleosin-/W. lep TR/Trxh fusion protein contained considerable reducing 

activity. It was anticipated that a similar oleosin fusion construct encoding the 
Arabidopsis thioredoxin-reductase and thioredoxin proteins would behave in an 
analogous manner. Molecular modeling was used to aid in the design of such a 
construct. Primers were designed (thioredoxin link-L: 5'- 
10 ACTGGAGATGTTGACTCGACGGATACTACGGATTGGTCGACGG 

CTATGGAAGAAGGACAAGTGATCGCCTGC-3'; (SEQ ID NO:5), and thioredoxin 

link-R: 

5'-ATCCGTCGAGTCAACATCTCCAGTTTCCTCGGTGGTCTCGTTAGCCTTCGAT 
CCAGCAATCTCTTGTAAGAATGCTCTGC-3'; (SEQ ID NO:6) to code for a 
15 synthetic linker peptide between the thioredoxin-reductase and thioredoxin 
proteins. These primers were used in conjunction with primers GVR 873 (5'- 
GTGGAAGCT T ATGG AG ATGG AG-3' ; SEQ ID NO:7) and GVR834 (5'- 
GAAAGCTTAAGCCAAGTGTTTG-3'; SEQ ID NO:2) to amplify a region coding 
for a thioredoxin-reductase-linker region-thioredoxin utilizing a gene splicing by 
20 overlap extension technique (Horton et al (1989) 15:61-68). The thioredoxin- 
reductase-tinker-thioredoxin encoding sequence was then cloned into a pre- 
existing pSBS3000 vector using standard molecular biology techniques 
(Sambrook et al (1990) Molecular Cloning 2"" Edition Cold Spring Harbour Press) 
The resulting plasmid was called pSBS2542. The sequence of the phaseolin 
25 promoter-oleosin-thioredoxin-reductase-linker-thioredoxin-phaseolin terminator 

region is shown in SEQ ID NO:33. An amino acid sequence comparison 
between this Arabidopsis thioredoxin-reductase-linker-thioredoxin and the M. 
leprae TR/Trxh protein is shown in Figure 1 2. 

Plasmids pSBS2510, P SBS2520. pSBS2521, pSBS2527. pSBS2529, 
30 pSBS2530, P SBS2531 and pSBS254.2 were electroporated into Agrobacterium 
strain EHA101 . These Agrobacterium strains were used to transform 
Arabidopsis. Arabidopsis transformation was done essentially as described in 
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"Arabidopsfs Protocols; Methods in molecular biology Vol 82. Edited by 
Martinez-Zapater JM and Salinas J. ISBN 0-89603-391-0 pg 259-266 (1998) 
except the putative transgenic plants were selected on agarose plates containing 
80/yM L-phosphinothricine, after they were transplanted to soil and allowed to 
5 set seed. 

EXAMPLE 3 

Polvacrvlamide qelelectrophoresls and lmmunoblot tlnq of tr ansgenic , seed 
extracts. 

10 Source of Arabidopsis thioredoxin, thioredoxin-reductase and oleosin 

antibodies. 

The Arabidopsis thioredoxin and thioredoxin-reductase genes were cloned in 
frame in bacterial expression vector pRSETB (Invitrogen) to allow for the 
overexpression of Arabidopsis thioredoxin and thioredoxin-reductase proteins. 

15 These proteins were purified using standard protocols (see eg Invitrogen 

protocol) and used to raise antibodies in rabbits using standard biochemical 
techniques (See eg Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989). The Arabidopsis oleosin gene genes was cloned in frame in 
bacterial expression vector pRSETB (Invitrogen) to allow for the overexpression 

20 Arabidopsis oleosin protein. This protein was purified using standard protocols 
(see eg Invitrogen protocol) and used to prepare mouse monoclonal antibodies 
using standard biochemical techniques (See eg Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989). 

Preparation of total Arabidopsis seed extracts for PAGE. Arabidopsis 

25 seeds were ground in approximately 20 volumes of 2% SDS, 50 mM Tris-CI,, 
this extract was boiled, spun and the supernatant was prepared for 
polyacrylamide gelelectrophoresis (PAGE) using standard protocols. 
Preparation of Arabidopsis oif-body-protein extracts. 

Arabidopsis seeds were ground in approximately 20 volumes of water and spun 
30 in a microfuge. The oil bodies were recovered and washed sequentially with 
approximately 20 volumes of water, a high stringency wash buffer, containing 
8M urea and 100 mM sodiumcarbonate and water. After this last wash the 
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oil bodies are prepared for poly acrylamide gelelectrophoresis (PAGE) using 

standard protocols. 

Analysis of seed and oil body extracts from plants transformed with 

pSBS2510 

5 Total seed and oil body protein extracts from plants transformed with 
DSBS2510 were loaded onto polyacrylamide gels and either stained with 
coomassie brilliant blue or electroblotted onto PVDF membranes. The membranes 
were challenged with a polyclonal antibody raised against Arabidopsis 
thioredoxin, or a monoclonal antibody raised against the Arabidopsis 18.5 kDa 

10 oleosin and visualized using alkaline phosphatase. Expression of the oleosin- 
thioredoxin results in an additional band of 31 .2 kDa. The results indicate that 
the thioredoxin antibodies are immunologically reactive with a band of the right 
predicted molecular weight (31.2 kDa), and the oleosin antibodies are also 
immunologically reactive with a band of the right predicted molecular weight for 

15 the fusion protein (31.2 kDa) in addition to a band corresponding to the native 
Arabidopsis oleosin (18.5 kDa). This indicates that oleosin-thioredoxin is 
expressed in Arabidopsis seeds and is correctly targeted to oil bodies. 

Analysis of seed and oil body extracts from plants transformed with 
pSBS252 1 

20 Total seed and oil body protein extracts from plants transformed with 

pSBS25121 were loaded onto polyacrylamide gels and either stained with 
Coomassie brilliant biue or electroblotted onto PVDF membranes. The 
membranes were challenged with a polyclonal antibody raised against 
Arabidopsis thioredoxin, or a monoclonal antibody raised against the Arabidopsis 

25 18.5 kDa oleosin and visualized using alkaline phosphatase. Expression of the 
thioredoxin-oleosin results in an additional band of 31 .2 kDa. The results 
indicate that the thioredoxin antibodies are immunologically reactive with a band 
of the right predicted molecular weight (31.2 kDa), and the oleosin antibodies 
are also immunologically reactive with a band of the right predicted molecular 

30 weight for the fusion protein (31 .2 kDa) in addition to a band corresponding to 
the native Arabidopsis oleosin (1 8.5 kDa). This indicates that thioredoxin- 
oleosin is expressed in Arabidopsis seeds and is correctly targeted to oil bodies. 
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Ana/ysfs of seed extracts from plants transformed with pSBS2520 Total 
seed extracts from plants transformed with pSBS2520 were loaded onto 
polyacrylamide gels and either stained with Coomassie brilliant blue or 
electroblotted onto PVDF membranes. The membranes were challenged with a 
5 polyclonal antibody raised against Arabidopsis thioredoxin and visualized using 
alkaline phosphatase. The results indicated that the thioredoxin antibodies are 
immunologically reactive with a band of approximately the right predicted 
molecular weight (12 kDa). Untransformed seeds do not show a detectable 
thioredoxin band. 

1 0 Analysis of seed and oil body extracts from plants transformed with 

pSBS2S29 

Total seed and oilbody protein extracts from plants transformed with P SBS2529 
were loaded onto polyacrylamide gels and electroblotted onto PVDF membranes. 
The membranes were challenged with a polyclonal antibody raised against 

1 5 Arabidopsis thioredoxin-reductase, or a monoclonal antibody raised against the 
Arabidopsis 18.5 kDa oleosin and visualized using alkaline phosphatase. 
Expression of the thioredoxin-reductase -oleosin results in an additional band of 
53.8 kDa. The results indicate that the thioredoxin-reductase antibodies are 
immunologically reactive with a band of the right predicted molecular weight for 

20 the fusion protein (53.8 kDa), the oleosin antibodies are also immunologically 
reactive with a band of the right predicted molecular weight (53.8 kDa) in 
addition to a band corresponding to the native Arabidopsis oleosin (18.5 kDa). 
This indicates that thioredoxin-reductase-oleosin is expressed in Arabidopsis 
seeds. 

25 Analysis of seed extracts from plants transformed with P SBS2527 Total 

seed extracts from plants transformed with P SBS2527 were loaded onto 
polyacrylamide gels and electroblotted onto PVDF membranes. The membranes 
were challenged with a polyclonal antibody raised against Arabidopsis 
thioredoxin-reductase and visualized using alkaline phosphatase. The 

30 thioredoxin-reductase antibodies are immunologically reactive with a band of 
approximately the right predicted molecular weight for the (35.3 kDa). 
Untransformed seeds do not show a detectable thioredoxin band. 
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Analysis of seed extracts from plants transformed with P SBS2531 A 
protein gel and immunoblot was prepared assaying the expression of oleosin- 
DMSR in Arabidopsis T2 seeds and correct targeting to Arabidopsis oil bodies. 
The expected molecular weight based on the deduced amino acid sequence is 
5 calculated to be 53,81 7 Da. In the oil body extract of the transgenic oleosin- 
thioredoxin-reductase sample an extra band of approximately 54 kDa was 
observed. This band was confirmed to be oleosin-thioredoxin-reductase by 
immunoblotting. From the polyacrylamide gel it was observed that the 
expression of the oleosin -Thioredoxin-reductase is about double compared to 
10 the expression of the major 18.5 KDa Arabidopsis oleosin. This represents 
approximately 2-4 % of total seed protein. 

Analysis of seed extracts from plants transformed with pSBS2530 A 
protein gel and immunoblot was prepared assaying the expression of oleosin- 
M.lep TR/Trxh in Arabidopsis 12 seeds and the correct targeting to Arabidops.s 
1 5 oil bodies. The expected molecular weight based on the deduced amino ac.d 
sequence is calculated to be 67,550 Da. In the oil body extract of the 
transgenic oleosin-M/ep TR/Trxh sample an extra band of approximately 68 kDa 
was observed. This band was confirmed to be oleosin-M/ep TR/Trxh by 
immunoblotting. From the polyacrylamide gel it was observed that the 
20 expression of the oleosin-/W./ep TR/Trxh is similar to the expression of the major 
18.5 kDa Arabidopsis oleosin. This represents approximately 1-2 % of total 
seed protein. 

Analysis of seed extracts from plants transformed with P SBS2542 Crude 
oil body extracts from pSBS2542 lines were prepared by grinding 100//g of seed 

25 in 1 mL of 100mM Tris buffer at P H 7.5. The samples were then centrifuged in 
order to isolate the oil body fraction. The oil body fraction was then loaded on 
an SDS polyacrylamide gel for expression analysis. A Coomassie stained gel 
revealed that the synthetic fusion accumulated to high levels in crude oil body 
extracts from 3 of the 4 lines tested. It was estimated that the fusion protein 

30 represented approximately 2-5% of total seed protein. Furthermore, western 
blots utilizing either anti-thioredoxin or anti-thioredoxin-reductase antibod.es 
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confirmed that the over expressed 70 kDa protein was indeed oleosin- 
thioredoxin-reductase-linker-thioredoxin. 

EXAMPLE 4 

5 Rtnloalcai activity "* thioredoxin and thiore doxln-red, .nta.se transformers 

Initial reduction assays: 
DTNB assay 

The activity of the thioredoxin and thioredoxin reductase was determined 
using a colorimetric DTNB l5,5'-dithiolbis (2-nitrobenzoic acid)] assay. The 

10 assay was performed in a 700 //L reaction volume containing 1 0OmM Tris-CI pH 
8.0, 5 mM EDTA, 200//M DTNB [5,5'-dithiolbis (2-nitrobenzoic acid)] and ZOOuU 
NADPH. If thioredoxin-reductase and thioredoxin are added, NADPH will reduce 
the thioredoxin-reductase, which will then reduce thioredoxin, which will, in 
turn, reduce DTNB (see equations below). 

15 NADPH, + thioredoxin-reductase,, --> thioredoxin-reductase„„ + NADP* 

thioredoxin-reductase red + thioredoxin ox > thioredoxin,,, + thioredoxin- 

reductase ox 

thioredoxin red + DTNB ox > 2(2-nitro-5-mercaptobenzoic acid) + 

thioredoxin ox 

20 The formation of the yellow product was monitored by measuring the 

OD 4t2 in a spectrophotometer after a set period of time (usually 0.5-2 hours). 
The results of initial activity assays are shown in the bar graph in Figure 4 and 
described below. 

Initially, 100 //g of total seed proteins were added from each of the 
25 Arabidopsis transgenic lines, pSBS2520 (cytosolic thioredoxin) and P SBS2527 
(cytosolic thioredoxin-reductase), which corresponds to approximately 1 U9 of 
cytosolic thioredoxin and thioredoxin-reductase used in the assay. In this case, 
the amount of DTNB reduced was comparable to the reduction caused by 1 /yg 
each of £ coif thioredoxin and thioredoxin-reductase. In these plant seed 
30 samples, background readings were very low when only one of the 2 extracts 
(either cytosolic thioredoxin or cytosolic thioredoxin-reductase; Figure 4, bars 3 
and 6, respectively) was added to the reaction, along with wild type oil bodies. 
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Analysis with oil body fractions from transgenic seeds revealed that 
Arabidopsis thioredoxin and thioredoxin-reductase were substantially less active 
when fused to oleosins on oil bodies. Approximately 300 p 9 of crude, 
unwashed oil-body-protein was used in the assay (which corresponds to 10- 
5 30//g of thioredoxin-oleosin (pSBS 2521 ; Figure 4, bar 2), oleosin-thioradoxin 
[pSBS 2510, Figure 4, bar 1), thioredoxin-reductase-oleosin (pSBS 2529. Figure 
4, bar 5), or oleosin-thioredoxin-reductase (pSBS 2531, Figure 4, bar 4). The oil- 
body-proteins were tested in conjunction with 100^g of total seed protein 
containing approximately 1/ig of cytosolic thioredoxin (pSBS 2520) or 

10 thioredoxin-reductase (pSBS 2527). 

In such assays, pSBS2529 (thioredoxin-reductase-oleosin) and P SBS2531 
(oleosin-thioredoxin-reductase) do contain reductase activity when combined 
with cytosolic thioredoxin from pSBS2520 (see Figure 4, bars 7 and 8, 
respectively). Experiments estimated that the reductase activity of oleosin- 
1 5 thioredoxin-reductase was about 1 0-15% that of the cytosolic thioredoxin- 
reductase. The addition of tween at a final concentration of 0.4% could 
enhance this activity 2 or 3 fold. Interestingly, oleosin-thioredoxin-reductase 
(pSBS 2531) appears to be capable of reducing DTNB in the absence of added 
thioredoxin, although added thioredoxin causes significantly more DTNB 
20 reduction (see Figure 4; compare bar 4 W.T. + oleosin-thioredoxin-reductase to 
bar 7 thioredoxin + oleosin-thioredoxin-reductase). Experiments with pSBS2521 
(thioredoxin-oleosin) or pSBS2510 (oleosin-thioredoxin) combined with cytosolic 
thioredoxin-reductase from P SBS2527 (see Figure 4, bars 10 and 1 1, 
respectively) indicate that thioredoxin activity of these fusions is undetectable at 

25 these concentrations. 

Oil bodies from the transgenic Arabidopsis line, pSBS2530 (oleosin-M./ep 
TR/Trxh) contain significant thioredoxin/thioredoxin-reductase activity (see Figure 
4, bar 12). One hundred micrograms of crude oil body protein for pSBS2530 
was tested (corresponding to approximately 5//g of oleosin- M.Iep TR/trxh 
fusion) in the assay. Based on the assay, it was estimated that this fusion is 
about 25-40% as active as cytosolic Arabidopsis thioredoxin and thioredoxin- 
reductase (Figure 4, bar 9) when comparing specific activity. 

RECTIFIED SHEET (RULE 91) 



30 



WO 02/50289 



PCT/US01/50240 



-92- 



tnsulin reduction assay 

The results from the DTNB assays were confirmed with insulin reduction 
assays. This assay contained insulin at a final concentration of 1mg/mL in 
100mM KH 2 P0 4 pH 7.0 + 5 mM EDTA. In the presence of NADPH (500//M). 
5 thioredoxin, and thioredoxin-reductase, insulin is reduced and precipitates from 
the solution. Normally, insulin reduction is followed by measuring turbidity at 
OD 650. Alternatively, one can measure the conversion of NADPH 2 to NADP + 
by monitoring the decrease in absorbance at 340 nm. 

Both of the assays are difficult to measure when oil bodies are present, 
10 due to interference with the spectrophotometer readings. However, qualitative 
. data could be obtained by centrifuging the tubes after a set period of time, and 
determining if an insulin pellet was present (oil bodies float to the top, while the 
insulin precipitate pellets out). Alternatively, samples could be filtered after a set 
period of time, and the change in absorbance at 340 nm could be measured. As 
15 mentioned previously, the results of the insulin reduction assays agreed with 
those of the DTNB assay, with the exception of the observation that P SBS2531 
(oleosin-thioredoxin-reductase) only reduced insulin in the presence of free 

thioredoxin from pSBS2520. 

Assays on seeds from Arabidopsis crosses that co-express oleosin- 
20 thioredoxin and oleosin-thioredoxin-reductase. 

Based upon initial DTNB and insulin reduction assays, it was apparent that 
mixing oil bodies from oleosin<-> thioredoxin and oleosin<-> thioredoxin- 
reductase transgenic seeds resulted in very limited reducing activity (Note: the 
<-> indicates both configurations of oleosin fusions; ie. oleosin<->thioredoxin 
25 would represent oleosin-thioredoxin and thioredoxin-oleosin fusions). 

To determine whether having oleosin <-> thioredoxin and oleosin <- 
> thioredoxin-reductase proteins present on the same oil body would have a 
positive effect on the reducing activity of these proteins, crosses were set up to 
generate double transgenic Arabidopsis lines. The crosses are illustrated in 
30 Table 2. 
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TABLE 2 



10 



15 



20 



25 



30 



Male 




Female 


Confirmed double 
transgenic lines 
(PCR and Western 
Blot) 


oleo-thioredoxin 


X 


oleo-thioredoxin-reductase 


4 


. oleo-thioredoxin 


X 


thioredoxin-reductase-oleo 


1 


thioredoxin-oleo 


X 


oleo-thioredoxin-reductase 


0 


thioredoxin-oleo 


X 


thioredoxin-reductase-oleo 


■ 4 


oleo-thioredoxin- 
reductase 


X 


oleo-thioredoxin 


' 2 


oleo-thioredoxin- 
reductase 


X 


thioredoxin-oleo 


0 


thioredoxin- 
reductase-oleo 


X 


oleo-thioredoxin 


7 


thioredoxin- 
reductase-oleo 


X 


thioredoxin-oleo 


0 



seedlings were transferred to soil after approximately 2 weeks. PCR 
experiments on DNA isolated from the seedlings identified a number of plants 
which contain both an oleosin <-> thioredoxin and an oleosin <->thioredoxin- 
reductase gene construct within their genome. 

Seeds were harvested from these plants for expression and activity 
assays. Western blots were carried out to confirm expression of both oleosin <- 
>thioredoxin and oleosin <->thioredoxin-reductase in the lines. DTNB and 
insulin reduction assays were also performed to compare activity between single 
transgenic parent lines and the double transgenic offspring and results are 
summarized in Table 3. Table 3 summarizes DTNB reducing activity of various 
transgenic lines. The last 2 rows compare mixing oil bodies from single 
transgenic parent lines to using oil bodies from double transgenic offspring. 
Relative activity for the E coli thioredoxin and thioredoxin mixture is set at 1 00 
percent. 
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TABLE 3 



10 



15 



20 




25 



30 



Arabidopsis "free" thioredoxin + 
thioredoxin-reductase 
( P SBS2520 4- pSBS2527) 




Based on DTNB and insulin reduction assays, it is evident that double 
transgenic plants co-expressing oleosin <-> thioredoxin and oleosin<- 
> thioredoxin-reductase on the same, single oil body contained significantly more 
reducing activity compared to mixing oil bodies from single transgenic oleosin <- 
>thioredoxin and oleosin <-> thioredoxin-reductase lines. It was additionally 
apparent that oil body extracts from co-expressing lines contained more reducing 
activity compared to line pSBS2530 (oleosin-M. lep TRHTxh), which was 
previously identified as the line containing the highest reducing activity from oil 
bodies. 

These results suggest that the creation of double transgenic lines (either 
through crossing or by co-transforming 2 expression constructs into plants) may 
represent one means by which we could solve our initial problem of not being 
able to generate reducing activity by mixing oil bodies from oleosin <- 
>thioredoxin and oleosin<->thioredoxin-reductase single transgenic lines. 

Assays on seeds from Arabidopsis P SBS2542 transgenic fines that 
express oleosin-thioredoxin-reductase-linker-thioredoxin. 

Oil body extracts from four P SBS2542 lines were tested for reducing activity in 
DTNB and insulin reduction assays, using standard protocols described 
previously. Again, oil body extracts containing the oleosin-thioredoxin- 
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10 



15 



20 



reductase-linker-thioredoxin protein possessed significant reducing activity. 
Based on such assays, it was revealed that the oleosin-thioredoxin-reductase- 
Hnker-thioredoxin synthetic fusion protein was more active than the oleosin-M. 
lep TR/Trxh fusion. Furthermore, oil bodies containing the oleosin-thioredoxin- 
reductase-iinker-thioredoxin protein appeared to have more reducing activity 
compared to oil bodies from double transgenic lines that co-expressed oleosin <- 
> thioredoxin and oleosin <-> thioredoxin-reductase. The results comparing 
reducing activity for the various thioredoxin-reductase/thioredoxin constructs is 
summarized in Table 4. Table 4 summarizes DTNB reducing activity of various 
transgenic lines. The pSBS2542 line expressing oleosin-thioredoxin-reductase- 
hnker-thioredoxin contains significant reducing activity, comparable to the "free- 
forms of Arabidopsis thioredoxin and thioredoxin-reductase and the equivalent E. 
coli proteins. Relative activity for the E coli thioredoxin and thioredoxin mixture 
is set at 1 00 percent. 

TABLE 4 



25 



30 



Source Material 


Relative Activity (%) 


Eco/itrx + NTR 


100 


Arabidopsis "free" thioredoxin + thioredoxin- 
reductase 
(pSBS2520 + pSBS2527) 


100 


oleosin- M. iep TR/Trxh 
(pSBS2530) 


-30 


Oleosin < - > thioredoxin-reductase + 
oleosin <->thioredoxin 
(mixing oil bodies from single-transgenic parents) 


-3 


Oleosin < - > thioredoxin-reductase X 
oleosin < - > thioredoxin 
(various double transgenic lines) 


-50 


Oleosin-thioredoxin-reductase-Hnker-thioredoxin 

(pSBS2542) 


-75-100 
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Reduction assays comparing the utilization of NADH vs. NADPH as a 
cofaotor (electron donor) for the thioredoxin-reductase/thioredoxin system. 
DTNB and insulin reduction assays were conducted as described previously, 
except that NADH was substituted for NADPH as an electron donor in the 

5 system utilizing E. coli thioredoxin-reductase and thioredoxin. Thus, a 

comparison was conducted of the utilization of NADH versus NADPH as a 
cofactor for the E. coli thioredoxin-reductase/ thioredoxin system. For the DTNB 
assay, the reaction mixture consisted of 400 pM DTNB, 10 //g/mL E. coli 
thioredoxin, and 10pg/mL£ coli thioredoxin-reductase in 100mMTris-CI buffer 

10 pH 8.O. Either NADH or NADPH was then added to the DTNB reaction as 
follows: 

Reaction A. 200 //M NADPH (Sigma) 
Reaction B. 800 /jM NADH (Sigma) 
Reaction C. 800 /t/M NADH (Roche) 
15 Reaction D. (-) cofactor 

Reaction E. 800 /jM NADH (no TR or Trxh). 

For the insulin reduction assay, the reaction mixture consisted of 1 mg/mL 
bovine pancreatic insulin, 20 /yg/mL E. coli thioredoxin, and 20 //g/mL E. coli 
20 thioredoxin-reductase in 100mM potassium phosphate buffer at pH 7.0. Either 
NADH or NADPH was then added to the reaction as follows: 
Reaction A. 800 //M NADPH (Sigma) 
Reaction B. 800 /jM NADH (Sigma) 
Reaction C. 800 //M NADH (Roche) 
25 Reaction D. {-) cofactor 

Reaction E. 2 mM NADH (no TR or Trxh). 

The results indicate that NADH, purchased from either Sigma or Roche, could 
act as an electron donor in both the DTNB and insulin reduction assays. 
30 However, the rate of reduction was lower than the rate observed with NADPH 
as a cofactor. It was estimated that the rate of insulin reduction utilizing NADH 
as an electron donor was approximately 25-50% when compared to the 
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maximum rate using NADPH. Furthermore, it was estimated that the rate of 
DTNB reduction utilizing NADH as an electron donor was approximately 5-10% 
of the maximum rate using NADPH. Similar results were observed using the 
oleosin-thioredoxin-reductase-linker thioredoxin fusion protein on Arabidopsis oil 
5 bodies instead of the F. coli thioredoxin-reductase and thioredoxin. 

Example 5 

Production of multimeric immunoglobulin protein in plant seed cells and capture 
on oil bodies using Protein A - oleosin fusion proteins. 
10 1 _ Production of multimerir- immunogl obu lin protei n in plant seed cells 

For expression of multimeric-protein-complexes containing multimeric- 
immunoglobulin-complexes, the cDNA sequences encoding individual light and 
heavy chains can be isolated from; 1) cell lines expressing a particular antibody, 
such as clonal B cell lines, or a hybridoma cell line, or 2) may be a recombinant 
15 antibody, assembled by combining select light and heavy chain variable domains 
and available light and heavy chain constant domain sequences, respectively. 
Variable domains with specific binding properties may be isolated from screening 
populations of such sequences, usually in the form of a single-chain Fv phage 
display library. 

20 Starting from known nucleic acid sequences and a source of light and 

heavy chains, the mature polypeptide coding sequences of each chain is isolated 
with a secretion signal sequence. The signal sequence can be the native 
antibody sequence or derived from a known secreted plant sequence (e.g. a PR 
sequence from Arabidopsis or tobacco). The addition of a plant secretion signal 
25 sequence to both light and heavy chain mature coding sequences is carried out 
by standard molecular biology techniques. PCR fusion is used routinely to make 
such modifications. Secretion signal sequences are included to target the light 
and heavy immunoglobulin polypeptides for secretion from the cell and further 
assembly of the two chains into a multimeric-immunoglobulin-complex. For 
30 expression in transgenic plant seeds, an expression cassette is assembled 

comprising: 1) a regulatory promoter sequence to provide expression in plant 
seeds, 2) the secretion signal - light chain sequence, and 3) a regulatory 
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sequence to terminate transcription. A second expression cassette is assembled 
comprising: 1) a regulatory promoter sequence to provide expression in plant 
seeds, 2) the secretion signal - heavy chain sequence, and 3) a regulatory 
sequence to terminate transcription. Each of the antibody chain expression 
5 cassettes is cloned individually into an Agrobacterium plant transformation 
vector or is combined into a single transformation vector with both expression 
cassettes. In both cases, the expression cassettes are cloned into plant 
transformation vectors, between the left and right delineating border sequences, 
and adjacent to a plant selectable marker cassette. Each plant transformation 
10 vector is transformed into Agrobacterium. The resulting Agrobacterium strains 
are used to infect plant tissues. Transgenic plant material is regenerated and 
viable transgenic plants are selected. When individual transformation vectors are 
used, the transgenic plant lines that are produced, expressing either light or 
heavy chain sequences, are crossed to generate a single plant line expressing 
15 both chains in the same plant cell. When a single transformation vector, 
containing both light and heavy expression cassettes, is used, the initial 
transgenic plant line produces both light and heavy chain sequences in the same 
plant cell. 

o _ production of transoe ni* nil bodies which display Protein A for the capture of 

20 immunoglobulins 

To capture and display immunoglobulin protein on oil bodies, oil bodies 
are engineered to display an immunoglobulin binding protein. In this example, the 
well-known antibody-binding domains from Protein A are used. Based on the 
known sequence for Protein A from Staphylococcus aureus, PCR primers are 

25 designed to isolate the five consecutive Ig-binding domains from the bacterial 
Protein A sequence. Primers are designed to allow cloning of the Protein A 
sequence as either an N-terminal or C-terminal fusion to an oleosin sequence for 
targeting to oil bodies. The sequence that encodes an in-frame translational 
fusion between Protein A and oleosin is cloned into a plant expression cassette 

30 for seed-specific expression. The final cassette consists of a regulatory 

promoter sequence that provides expression in seeds, the Protein A - oleosin 
fusion sequence, and a regulatory sequence to terminate transcription. The 
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Protein A - oleosin expression cassette is cloned into a plant transformation 
vector compatible with Agrobacterium - mediated plant transformation. The 
transformation vector comprises left and right border sequences flanking the 
Protein A - oleosin expression cassette and an adjacent plant selectable marker 
6 cassette. The Agrobacterium strain containing this vector is used to infect plant 
tissues and subsequent regeneration and selection from transgenic plant material 

to create transgenic plants. 

- Ca eturg and disD^v of multi m pric-immunoqlobulins on oil bodies displaying 

Protein A 

10 Having produced light and heavy chain multimeric immunoglobulin 

complexes in one transgenic plant line and the display of Protein A on oil bodies 
through the oil body targeting of a Protein A - oleosin fusion protein in a second 
plant line, at least two embodiments can be used to capture the immunoglobulin 
complex on the Protein A oil bodies. In the first embodiment, transgenic seed 
1 5 from both the immunoglobulin and the Protein A - oleosin expression lines is 
combined in an optimum ratio and then ground together such that the disrupted 
material, from both seed lines would be combined in the same extract. The 
combined seed extracts are mixed and/or incubated under conditions that allow 
maximum recovery of the immunoglobulin by Protein A. The oil body fraction is 
20 separated using standard phase separation techniques (e.g. centrifugation). The 
recovered oil body fraction contains both native oil bodies, from the 
immunoglobulin expression line, and transgenic Protein A oil bodies from the 
Protein A - oleosin expression line. 

In a second embodiment, the plant lines expressing the immunoglobulin 
25 complex and the Protein A - oleosin fusion are crossed and individual plant lines 
expressing both components are identified and propagated. In this approach, the 
immunoglobulin complex and the Protein A - oleosin fusion are produced in 
different cellular compartments of the same plant seed cell. Seed from the 
double transgenic line is ground to disrupt the cellular material and mix the 
30 contents of all cellular compartments, including combining the immunoglobulin in 
the extracellular compartment and the Protein A - oleosin on the oil body in the 
cytosolic compartment. The material is mixed and/or incubated under conditions 
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to allow maximum recovery of the immunoglobulin by Protein A, and the oil body 
fraction is separated by phase separation techniques. The recovered oil body 
fraction contains the displayed Protein A and the capture immunoglobulin 
complex. 

Example 6 

Production of assembled multimeric-immunoglobulin-complexes as fusions with 

oil body targeting domains. 

Individual polypeptides are produced as a fusion protein with oil body 
targeting sequences (e.g. oleosin) for display on oil bodies. It has been found 
that the individual subunits of naturally associating heterodimeric proteins can be 
co-produced as individual oleosin fusions and still associate as an active 
heterodimer on the surface of the oil body. In this example, the heterodimer is 
the light and heavy chain subunits, or derived portions thereof, of an 
immunoglobulin complex. 
1 5 Production of an immunoglobulin Fab complex on oil bodies. 

The mature light chain sequence, lacking the secretion signal sequence, is 
attached as an in-frame N-terminal fusion to an oleosin sequence. This fusion 
sequence is assembled into a seed-specific expression cassette consisting of a 
seed-specific promoter sequence, the light chain - oleosin fusion sequence, and 
20 a transcriptional terminator sequence. The expression cassette is inserted 

between the left and right border markers, adjacent to a plant selectable marker 
cassette, of a transformation vector. The transformation vector, in 
Agrobacterium, is used to infect plants and generate transgenic plants. 

An equivalent construct for the heavy chain subunit, comprising the 
25 variable and constant heavy chain domains, is also attached as an in-frame 
fusion to oleosin and assembled into an expression cassette for seed-specific 
expression. The expression cassette can be a part of a separate transformation 
vector for the generation of a separate transgenic line, or the heavy chain 
expression cassette can be combined together with the light chain cassette into 
30 a single transformation vector. If light and heavy chain expression cassettes are 
transformed into plants on separate transformation vectors, the individual plant 
lines are crossed to create a single line expressing both heterodimer subunit - 
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oleosin fusions in the same plant cell. Seed from the double transgenic line, or a 
single transgenic line generated from the dual expression vector, is extracted to 
isolate oil bodies. The seed material is ground to release the cellular contents 
and oil bodies are isolated by phase separation. The targeting of both light and 
5 heavy chain sequence to oil bodies, as oleosin fusions, allows the association of 
the immunoglobulin complex on the surface of the oil body. 

Similar configurations, using the entire heavy chain sequence in 
combination with the entire light chain sequence, or using the variable domains 
from both the light and heavy chain sequences, are constructed to assemble 
10 different types of heteromultimeric-immunoglobulin-complexes (e.g., 

heterodimers) on the surface of oil bodies. 

The present invention should therefore not be seen as limited to the 
particular embodiments described herein, but rather, it should be understood that 
the present invention has wide applicability with respect to protein expression 
15 generally. Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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SUMMARY OF SEQUENCES 

SEQ ID NOs:l~4 set forth primers which were synthesized for the 
isolation of the thioredoxin h (Trxh) and thioredoxin reductase genes from 
Arabidopsis, as described in Example 1 . 
5 SEQ ID NOs:5-7 set forth primers which were designed to code for a 

specific linker peptide between thioredoxin reductase and thioredoxin proteins, 

as described in Example 2. 

SEQ ID NOs:8, 10 and 1 1 set forth the nucleotide sequence and the 
deduced amino acid sequence of the NADPH thioredoxin reductase sequence 

10 isolated herein as described in Example 1. 

SEQ ID NOs:9 and 11, respectively, set forth the nucleotide sequence of 
the published NADPH thioredoxin reductase sequence (ATTHIREDB) and the 

deduced amino acid sequence. 

SEQ ID NO:12 sets forth the deduced amino acid sequence of the 
16 published NADPH thioredoxin reductase sequence. 

SEQ ID NO: 13 sets forth the deduced amino acid sequence of the NADPH 
reductase sequence isolated in this report. 

SEQ ID NOs:14 and 15 set forth the nucleotide sequence of the phaseolin 
promoter-Arab/dopsfs Trxh-phaseolin terminator sequence described in Example 
20 2, and the deduced amino acid sequence. The Trxh coding sequence and its 

deduced amino acid sequence is indicated. The phaseolin promoter corresponds 
to nucleotide 6-1554, and the phaseolin terminator corresponds to nucleotide 
sequence 1905-3124. The promoter was furnished with a Pstl site (nt 1-6) and 
the terminator was furnished with a Hindill site (nt 1898-1903) and a Kpnl site 
25 (nt 3124-3129) to facilitate cloning. 

SEQ ID NOs:16, 17 and 18 set forth the nucleotide sequence of the 
phaseolin promoter-oleosin Trxh-phaseolin terminator sequence described in 
Example 2, and the deduced amino acid sequences. The oleosin-Trxh coding 
sequence and the deduced amino acid sequences are indicated in SEQ ID NO:16. 
30 As in SEQ ID NO: 14, the phaseolin promoter corresponds to nucleotide 6-1554. 
The sequence encoding oleosin corresponds to nt 1555-2313, the intron in this 
sequence {nt 1908-2147) is indicated in italics. The Trxh coding sequence 
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corresponds to nt 2314-2658. The phaseolin terminator corresponds to 
nucleotide sequence 2664-3884. 

SEQ ID 1X10:19, 20 and 21 set forth the nucleotide sequence of the 
phaseolin promoter - Trxh oleosin-phaseolin terminator sequence as described in 
5 Example 2, and the deduced amino acid sequences. The Trxh oleosin- coding 
sequence and its deduced amino acid sequences are indicated in SEQ ID NO:19. 
As in SEQ ID NOs:14 and 16, the phaseolin promoter corresponds to nucleotide 
6-1554. The Trxh coding sequence corresponds to nt 1555-1896. The 
sequence encoding oleosin corresponds to nt 1897-2658, the Intron in this 
10 sequence (nt 2250-2489) is indicated in italics. The phaseolin terminator 
corresponds to nucleotide sequence 2664-3884. 

SEQ ID NO:22 and 23 set forth the nucleotide sequence of the phaseolin 
promoter-thioredoxin-reductase-phaseolin terminator sequence as described in 
Example 2, and the deduced amino acid sequence. The thioredoxin-reductase 
15 coding sequence and its deduced amino acid sequence is indicated in SEQ ID 
NO:22. The phaseolin promoter corresponds to nucleotide 6-1554. The 
thioredoxin-reductase coding sequence corresponds to nt 1555-2556 and the 
deduced amino acid is set forth in SEQ ID NO:23. The phaseolin terminator 
corresponds to nucleotide sequence 2563-3782. 
20 SEQ ID NOs:24 r 25 and 26 show the nucleotide sequence of the 

phaseolin promoter-oleosin thioredoxin-reductase-phaseolin terminator sequence 
as described in Example 2, and the deduced amino acid sequences. The oleosin- 
thioredoxin-reductase coding sequence and its deduced amino acid sequence is 
indicated. The phaseolin promoter corresponds to nucleotide 6-1554. The 
25 sequence encoding oleosin corresponds to nt 1555-2313, the intron in this 
sequence (nt 1980-2147) is indicated in italics. The thioredoxin-reductase 
coding sequence corresponds to nt 2314-3315. The phaseolin terminator 
corresponds to nucleotide sequence 3321-4540. 

SEQ ID NOs:27, 28 and 29 show the nucleotide sequence of the 
30 phaseolin promoter - thioredoxin-reductase oleosin - phaseolin terminator 

sequence as described in Example 2, and the deduced amino acid sequences. 
The thioredoxin-reductase coding sequence and its deduced amino acid 
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sequence is indicated. The phaseolin promoter corresponds to nucleotide 6- 
1554. The thioredoxin-reductase coding sequence corresponds to nt 1555- 
2553. The sequence encoding oleosin corresponds to nt 2554-3315, the intron 
in this sequence (nt 2751-3146) is indicated in italics. The phaseolin terminator 
5 corresponds to nucleotide sequence 3321-4540. 

SEQ ID NO:30, 31 and 32 show the sequence of the phaseolin promoter - 
oleosin - Mlep thioredoxin-reductase/thioredoxin -phaseolin terminator sequence 
as described in Example 2, and the deduced amino acid sequences. The oleosin- 
Mlep thioredoxin-reductase/thioredoxin coding sequence and its deduced amino 
10 acid sequence is indicated. The phaseolin promoter corresponds to nucleotide 6- 
1554. The sequence encoding oleosin corresponds to nt 1555-2313, the intron 
in this sequence (nt) is indicated in italics. The Mlep thioredoxin- 
reductase/thioredoxin coding sequence corresponds to nt 2314-3690. The 
phaseolin terminator corresponds to nucleotide sequence 3698-4917. 
15 SEQ ID NOs:33, 34 and 35 set forth the nucleotide sequence of the 

phaseolin promoter-oleosin-thioredoxin-reductase-linker-thioredoxin-phaseolin 
terminator region of P SBS2542, and the deduced amino acid sequences. The 
deduced amino acid sequence of oleosin-thioredoxin-reductase-linker-thioredoxin 
is also shown in SEQ ID NO:33. Amino acids representing oleosin are set forth 
20 at positions 1-173, those amino acids representing thioredoxin-reductase are set 
forth at positions 1 74-501 , those amino acids representing the linker or spacer 
peptide are set forth at positions 501-524, and those representing thioredoxin 

are set forth at positions 525-636. 

SEQ ID NOs:38 and 39 set forth the nucleotide sequence of Arabidopsis 
25 Thaliana Thioredoxin h (Trx h 1) and the encoded protein, respectively. 

SEQ ID N0s:40 and 41 set forth the nucleotide sequence of Arabidopsis 
Thaliana Thioredoxin Reductase (NTR1) and the encoded protein, respectively. 

SEQ ID NOs:42 and 43 set forth the nucleotide sequence of E. Coli 
Thioredoxin (TrxA) and the encoded protein, respectively. 
30 SEQ ID NOs:44 and 45, set forth the nucleotide sequence of E. Coli 

Thioredoxin Reductase and the encoded protein, respectively. 
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SEQ ID NOs:46 and 47 set forth the nucleotide sequence of Human 
Thioredoxin and the encoded protein, respectively. 

SEQ ID NOs:48 and 49, set forth the nucleotide sequence of Human 
Thioredoxin Reductase and the encoded protein, respectively. 

SEQ ID NOs:50 and 51, respectively, set forth the nucleotide sequence of 
Mycobacterium leprae Thioredoxin-Thioredoxin Reductase and the encoded 
protein, respectively. 

SEQ ID NOs:52-313 are described in Table 5. 

TABLE 5 



SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


PLANT THIOREDOXINS 


Thioredoxin f-type 


52 


(Q9XFH8) Thioredoxin F-type 1 , chloroplast precursor (TRX- 
F1). - Arabidopsis thaliana (Mouse-ear cress) 


53 


(Q9XFH9) Thioredoxin F-type 2, chloroplast precursor (TRX- 
F2). {GENE: AT5G16400 OR MQK4.13} - Arabidopsis thaliana 
(Mouse-ear cress) 


54 


(048897) Thioredoxin F-type, chloroplast precursor (TRX-F). 
{GENE: TRXF} - Brassica napus (Rape) 


55 


(081332) Thioredoxin F-type, chloroplast precursor (TRX-F). - 
Mesembryanthemum crystallinum (Common ice plant) 


56 


(P29450) Thioredoxin F-type, chloroplast precursor (TRX-F). - 
Pisum sativum (Garden pea) 


57 


(P09856) Thioredoxin F-type, chloroplast precursor (TRX-F). - 
Spinacia oleracea (Spinach) 


Thioredoxin m-type 


58 


(P06544) Thioredoxin 1 (TRX-1) (Thioredoxin M). {GENE: 
TRXA} - Anabaena sp. (strain PCC 7119) 


59 


(048737) Thioredoxin M-type 1 , chloroplast precursor (TRX- 
M1). {GENE: AT1G03680 OR F21B7J7 OR F21B7.28} - 
Arabidopsis thaliana (Mouse-ear cress) 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


60 I 


(Q9SEU8) Thioredoxin M-type 2, chloroplast precursor (TRX- 
M2). {GENE: AT4G03520 OR F9H3.15 OR T5L23.1} - 
Arabidopsis thaliana (Mouse-ear cress) 


61 


(Q9SEU7) Thioredoxin M-type 3, chloroplast precursor (TRX- 
M3). {GENE: AT2G15570 OR F9013.12} - Arabidopsis 


i 62 I 


(Q9SEU6) Thioredoxin M-type 4, chloroplast precursor (TRX- 
iv/izi\ - ArahiHnn<5i<? thaliana (Mouse-ear cress) 


| 63 J 


(Q9XGS0) Thioredoxin M-type, chloroplast precursor (TRX-M). 
- Brassica napus (Rape) 


I 64 I 


(P23400) Thioredoxin M-type, chloroplast precursor (TRX-M) 
(Thioredoxin CH2). {GENE: TRXM} - Chlamydomonas 
reinnaraiii 


I 65 I 


(Q41864) Thioredoxin M-type, chloroplast precursor (TRX-M). 
fr^PMP* TRM1\ - 7ea mavs (Maize) 


I 66 


1 (Q9ZP20) Thioredoxin M-type, chloroplast precursor (TRX-M). 
- Oryza sativa (Rice) 


67 


(P48384) Thioredoxin M-type, chloroplast precursor (TRX-M). - 
| Pisum sativum (Garden pea) 


] 68 


(P07591) Thioredoxin M-type, chloroplast precursor (TRX-M). - 
| Spinacia oleracea (Spinach) 


I 69 


1 (Q9ZP21) Thioredoxin M-type, chloroplast precursor (TRX-M). 
- Triticum aestivum (Wheat) 


I 70 


(P1 2243) Thioredoxin 1 (TRX-1) (Thioredoxin M). {GENE: 
TRXA OR TRXM} - Synechococcus sp. (strain PCC 7942) 
I (Anacystis nidulans R2) 


I 71 


(P37395) Thioredoxin. {GENE: TRXA OR TRX) - Cyanidium 
caldarium [Chloroplast] 


I 72 


(022022) Thioredoxin. {GENE: TRXA OR TRXM} - 
Cyanidioschyzon merolae [Chloroplast] 


73 


(P50338) Thioredoxin. {GENE: TRXA} - Griffithsia pacifica 
[Chloroplast] 



WO 02/50289 



PCT/US01/50240 



-107- 



10 



SEQ. ID 
NO.. 



74 



75 



76 



77 



78 



79 



80 



81 



82 



83 



84 



85 



86 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 

(P50254) Thioredoxin. {GENE: TRXA} - Porphyra yezoensis 
[Chloroplast] 



(P51225) Thioredoxin. {GENE: TRXA} - Porphyra purpurea 
[Chloroplast] 



Thioredoxin h-type 



(P29448) Thioredoxin H-type 1 (TRX-H-1). {GENE: TRX1 OR 
AT3G51030 OR F24M12.70} - Arabidopsis thaliana {Mouse- 
ear cress) 



(P20857) Thioredoxin 2 (TRX-2). {GENE: TRXB} - Anabaena 
sp. (strain PCC 7120) 



(Q42388) Thioredoxin H-type 1 (TRX-H-1) (Pollen coat 
protein). {GENE: THL-1 OR BOPC17} - Brassica napus (Rape), 
Brassica oleracea (Cauliflower) _ 



(P29449) Thioredoxin H-type 1 (TRX-H1 ). - Nicotiana tabacum 
(Common tobacco) 

(Q38879) Thioredoxin H-type 2 (TRX-H-2). {GENE: TRX2 OR 
AT5G39950 OR MYH19.14} - Arabidopsis thaliana (Mouse-ear 

cress) 

(Q39362) Thioredoxin H-type 2 (TRX-H-2). {GENE: THL-2} - 
Brassica napus (Rape) 
(Q07090) Thioredoxin H-type 2 (TRX-H2). - Nicotiana tabacum 
(Common tobacco) 



(Q42403) Thioredoxin H-type 3 (TRX-H-3). {GENE: TRX3 OR 
AT5G42980 OR MBD2.18} - Arabidopsis thaliana (Mouse-ear 

cress) 



(Q39239) Thioredoxin H-type 4 (TRX-H-4). {GENE: TRX4} - 
Arabido psis thaliana (Mouse-ear cress) 

(Q39241) Thioredoxin H-type 5 (TRX-H-5). {GENE: TRX5} - 
Arabidopsis thaliana (Mouse-ear cress) 



(064432) Thioredoxin H-type (TRX-H). {GENE: PEC-2} - 
Brassica rapa (Turnip) 
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SEQ. ID 
NO. 



SWISS PROTEIN IDENTIFIER 

(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



87 



88 



89 



90 



91 



92 



93 



(P80028) Thioredoxin H-type (TRX-H) (Thioredoxin CH1). 
{GENE: TRXH} - Chlamydomonas reinhardtii 



(Q96419) Thioredoxin H-type (TRX-H). - Fagopyrum 
esculentum (Common buckwheat) 



(Q42443) Thioredoxin H-type (TRX-H) (Phloem sap 13 kDa 
protein-1). - Oryza sativa (Rice) 



(065049) Thioredoxin H-type (TRX-H). {GENE: SB09} - Picea 
mariana (Black spruce) 

(Q43636) Thioredoxin H-type (TRX-H). - Ricinus communis 
(C astor bean) 

(064394) Thioredoxin H-type (TRX-H) (TrxTa). - Triticum 
aestivum (Wheat) 



(P29429) Thioredoxin. - Emericella nidulans (Aspergillus 
nidulans) 



VIRUSES, BACTERI A AND FUNGI THIOREDOXINS 

94 



(P80579) Thioredoxin (TRX). {GENE: TRXA} - Aticyclobacillus 
acidocaldarius (Bacillus acidocaldarius) 



95 



(028137) Thioredoxin. {GENE: AF21 45} - Archaeoglobus 
fulgidus ' - 



96 



(P14949) Thioredoxin (TRX). {GENE: TRXA OR TRX} - Bacillus 
subtilis 



97 
98 

99 



100 



(P0 0276) Thioredoxin. {GENE: NRDC} - Bacteriophage T4 

(O51088) Thioredoxin (TRX). {GENE: TRXA OR BB0061} - 
Borr elia burgdorferi (Lyme disease spirochete) 

(P57653) Thioredoxin (TRX). {GENE: TRXA OR BU597} - 
Buchnera aphidicola (subsp. Acyrthosiphon pisum) 
(Acyrthosip hon pisum symbiotic bacterium) 

(051890) Thioredoxin (TRX). {GENE: TRXA} - Buchnera 
aphidicola (subsp. Schizaphis graminum) 



101 



(P10472) Thioredoxin (TRX). {GENE: TRXA} - Chlorobium 
limicoia f.sp. thiosulfatophilum 
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SEQ. ID 
NO. 



102 



103 



104 



105 

106 
107 
108 
109 



110 



111 



112 



113 



114 



115 



116 



117 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



(Q9PJK3) Thioredoxin (TRX). {GENE: TRXA OR TC0826} - 
C hlamydia muridarum 

(Q9Z7P5) Thioredoxin (TRX). {GENE: TRXA OR CPN0659 OR 
CP0088} - Chlamydia pneumoniae (Chlamydophila 
pneumoniae) 



(P52227) Thioredoxin (TRX). {GENE: TRXA} - Chlamydia 
psitta ci (Chlamydophila psittaci) 

(084544) Thioredoxin (TRX). {GENE: TRXA OR CT539} - 
Chlamydi a trachomatis 

(P00275) Thioredoxin C-1 . - Corynebacterium nephridii 



(P07887) Thioredoxin C-2, - Corynebacterium nephridii 

(P52228) Th ioredoxin C-3. - Corynebacterium nephridii 

(P09857) Thioredoxin (TRX). {GENE: TRXA} - Chromatium 
vinosum . 



(P21 609) Thioredoxin (TRX). {GENE: TRXA} - Clostridium 
iitorale (Bacterium W6) 



(P81108) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
C lostridium sporogenes 

(P81109) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Clostridium sticklandii 



(Q9UW02) Thioredoxin (Allergen Cop c 2). - Coprinus comatus 
(Shaggy mane) 

(P29445) Thioredoxin 1 . {GENE: TRXA OR TRX1} - 
Dictyosteiium discoideum (Slime mold) 



(P29446) Thioredoxin 2 (Fragment). {GENE: TRXB OR TRX2} - 
Dictyosteiium discoideum (Slime mold) 



(P29447) Thioredoxin 3. {GENE: TRXC OR TRX3} - 
Dictyosteiium discoideum (Slime mold) 



(P00274) Thioredoxin 1 (TRX1) (TRX). {GENE: TRXA OR 
TSNC OR FIPA OR B3781} - Escherichia coh f Salmonella 
typhimurium 
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SEQ. ID 
NO. 



118 



119 



120 



121 



122 



123 



124 



125 



126 



127 



128 



129 



130 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EX AMPLES OF REDOX PROTEINS 

(P52232) Thioredoxin-like protein SLR0233. {GENE: SLR0233} 
- Synechocystis sp. (strain PCC 6803) 



(P33636) Thioredoxin 2 (Trx2). {GENE: TRXC OR B2582 OR 
Z3867 OR ECS3448} - Escherichia coli, Escherichia coli 
0157:H7 



(P21 610) Thioredoxin (TRX). {GENE: TRXA} - Eubacterium 
acidaminophiium 



(P43785) Thioredoxin (TRX). {GENE: TRXA OR TRXM OR 
H10084} - Haemophilus influenzae 



(P43787) Thioredoxin-like protein HI1115. {GENE: HI1115} - 
Haemophilus influenzae - 



(P56430) Thioredoxin (TRX). {GENE: TRXA OR HP0824 OR 
JHP0763} - Helicobacter pylori (Campylobacter pylori), 
Helicobacter pylori J99 (Campylobacter pylori J99) 



(Q9S386) Thioredoxin (EC 1.6.4.5) {GENE:TRXA} - Listeria 
m onocytogenes 

(Q57755) Thioredoxin. {GENE: TRX OR MJ0307} - 
Methanoc occus jannaschii 

(P47370) Thioredoxin (TRX). {GENE: TRXA OR TRX OR 
MG1 24} - Mycoplasma genitalium 

(P46843) Bifunctional thioredoxin-reductase/thioredoxin 
[Includes: Thioredoxin-reductase (EC 1.6.4.5) (TRXR); 
Thioredoxin]. {GENE: TRXB/A OR TRX OR ML2703} - 
Mycobact erium leprae 

(P75512) Thioredoxin (TRX). {GENE: TRXA OR TRX OR 
MPN263 OR MP570} - Mycoplasma pneumoniae 



(O30974) Thioredoxin (TRX). {GENE: TRXA} - Mycobacterium 
s megmatis 

(P52229) Thioredoxin (TRX) (MPT46). {GENE: TRXA OR TRX 
OR TRXC OR RV3914 OR MT4033 OR MTV028.05} - 
Mycobacterium tuberculosis 
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NO. 



10 



132 



133 



134 



135 



136 



137 



138 



139 



140 



141 



142 



143 



SWISS PROTEIN IDENTIFIER 

(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



(P34723) Thioredoxin. {GENE: TRXA} - Penicillium 
chrysogenum 

(Q9X2T1) Thioredoxin (TRX). {GENE: TRXA OR TRX OR 
PA5240} - Pseud omonas aeruginosa 

(P10473) Thioredoxin (TRX). {GENE: TRXA} - Rhodospirillum 
rubrum — 



(P08058) Thioredoxin (TRX). {GENE: TRXA} - Rhodobacter 
sphaeroides (Rhodopseudomonas sphaeroides) 



(Q9ZEE0) Thioredoxin (TRX). {GENE: TRXA OR RP002} - 
Rickettsia prowazekii < „ 



(P33791) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Streptomyces aureofaciens 



(P52230) Thioredoxin (TRX). {GENE: TRXA OR SCH24.11C} - 
Streptomyces coelicoior _ 



(Q05739) Thioredoxin (TRX). {GENE: TRXA} - Streptomyces 
clavuligerus _ 

(P52231) Thioredoxin (TRX). {GENE: TRXA OR SLR0623} - 
Synechocystis sp. (strain PCC 6803) 



(P73263) Thioredoxin-like protein SLR1139. {GENE: SLR1139} 
- Synechocystis sp. (strain PCC 6803) 



144 



145 



(P52233) Thioredoxin (TRX). {GENE: TRXA} - Thiobacillus 
ferrooxidans 

(P961 32) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Thiocapsa roseopersicina 



(P81 110) Thioredoxin (TRX) (Fragment). {GENE: TRXA} - 
Tissierella creatinophila 

(083889) Thioredoxin (TRX). {GENE: TRXA OR TP0919} - 
Treponema pallidum 
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SEQ. ID 
NO. 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



ANIMAL THI OREDOXIN 

146 I (097680) Thioredoxin. {GENE: TXN} - Bos taurus (Bovine) 

147 



148 



149 

150 
151 

152 



153 



154 



155 



156 



157 



158 



160 



(Q95108) Thioredoxin, mitochondrial precursor (MT-TRX). 
{GENE: TXN2} - Bos taurus (Bovine) 



(Q09433) Thioredoxin. {GENE: B0228.5} - Caenorhabditis 
elegans ' ' _ _ 



(P99505) Thioredoxin (Fragment). {GENE: TXN} - Canis 
familiaris (Dog 

(P0862 9) Thioredoxin. {GENE: TXN} - Gallus gallus (Chicken) 

(P47938) Thioredoxin (Deadhead protein). {GENE: DHD OR 
CG4193} - Drosophila melanogaster (Fruit fly) 



(P1 0599) Thioredoxin (ATL-derived factor) (ADF) (Surface 
associated sulphydryl protein) (SASP). {GENE: TXN OR TRDX 
OR TRX} - Homo sapiens (Human) 



(Q99757) Thioredoxin, mitochondrial precursor (MT-TRX). 
{GENE: TXN2} - Homo sapiens (Human) m 



(P29451) Thioredoxin. {GENE: TXN} - Macaca mulatta (Rhesus 
macaque) _ 



(P1 0639) Thioredoxin (ATL-derived factor) (ADF). {GENE: 
TXN} - Mus musculus (Mouse) 

(P97493) Thioredoxin, mitochondrial precursor (MT-TRX) 
{GENE: TXN2} - Mus musculus (Mouse) 



(P82460) Thioredoxin (Fragment). {GENE: TXN} - Sus scrofa 

(Pig) 

(P08628) Thioredoxin. {GENE: TXN} - Oryctolagus cuniculus 
(Rabbit) 



759 I (P11232) Thioredoxin. {GENE: TXN} - Rattus norvegicus (Rat) 



(P9761 5) Thioredoxin, mitochondrial precursor (MT-TRX) 
{GENE: TXN2 OR TRX2} - Rattus norvegicus (Rat) 



761 1 (P50413) Thioredoxin. {GENE: TXN} - Ovis aries (Sheep) 



PLANTS THIOREPOXIN-LIKE PROTEINS 
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SEQ. ID 
NO. 



162 



163 



164 



165 



166 



167 



168 



169 



170 



171 



172 



173 



174 



175 



176 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



(023166) THIOL-DISULFIDE INTERCHANGE LIKE PROTEIN 
(THIOREDOXIN-LIKE PROTEIN) {GENE:C7A1 0.1 60 OR 
AT4G37200 OR HCF164} - Arabidopsis thaliana (Mouse-ear 

cress) 

(Q9C9Y6) Thioredoxin-like protein {GENE:F1 701 4.1 8} - 
Arabidopsis thaliana (Mouse-ear cress) 



(Q9FYD5) Thioredoxin-like protein {GENE:F21 E1_1 80} - 
Arabidop sis thaliana (Mouse-ear cress) 

(Q38878) THIOREDOXIN-LIKE PROTEIN {GENE:TRX6 OR 
T7D17.3} - Arabidopsis thaliana (Mouse-ear cress) 



(Q9LVI2) Thioredoxin-like protein - Arabidopsis thaliana 
(Mouse-ear cress) __ 



(Q9SCN9) Thioredoxin-like protein {GENE:T4D2.1 50} - 
Arabidopsis thaliana (Mouse-ear cress) 



(Q9SRD7) Thioredoxin-like protein, 49720-48645 
(GENE:F28016.13} - Arabidopsis thaliana (Mouse-ear cress) 



(Q9SU84) THIOREDOXIN-LIKE PROTEIN {GENE:T1 6L4.1 80 OR 
AT4G29670} - Arabidopsis thaliana (Mouse-ear cress) 



(Q9SWG6) Thioredoxin-like protein {GENE:TRX} - Hordeum 
bulbosum 

(Q9SWG4) Thioredoxin-like protein {GENE:TRX} - Lolium 
perenne (Perennial ryegrass) 



(Q9AS75) Thioredoxin-like protein {GENE:P0028E1 0.1 7} - 
Oryza sativa (Rice) , 



(O04002) CDSP32 protein (Chloroplast Drought-induced Stress 
Protein of 32kDa) - Solanum tuberosum (Potato) 



(Q9SWG5) Thioredoxin-like protein {GENE:TRX} - Secale 
cereale (Rye) _ 



(Q9SP36) Thioredoxin-like protein (Fragment) {GENE:TRX} - 
Secaie cereale (Rye) 



(Q9U515) Thioredoxin-like protein - Manduca sexta (Tobacco 
hawkmoth) (Tobacco hornworm) t 
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SEQ. ID 
NO. 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 




177 



178 



(P43221) Thiohdisulfide interchange protein tlpA (Cytochrome 
c biogenesis protein tlpA). {GENE: TLPA) - Bradyrhizobium 
japonicum 

(P43787) Thioredoxin-like protein HI1115. {GENE: HI1115} - 
Haemophilus influenzae 



179 



(Q9GUP7) Thioredoxin-like protein {GENE:TRXLP} - Leishmania 
major _ 



180 I (Q9UV H0) Thioredoxin-like protein - Mortierella alpina 

181 I (P95355) Thioredoxin-like protein - Neisseria gonorrhoeae 

182 



183 



(Q98G37) Thioredoxin-like protein {GENE:MLL3505} - 
Rhizobiu m loti (Mesorhizobium loti) 

(P36893) Thiohdisulfide interchange protein helX precursor 
(Cytochrome c biogenesis protein helX). {GENE: HELX} - 
Rhodobacter capsulatus (Rhodopseudomonas capsulata) 



184 



185 



(P52232) Thioredoxin-like protein SLR0233. {GENE: SLR0233} 

- Synech ocystis sp. (strain PCC 6803) 

(P73263) Thioredoxin-like protein SLR1139. {GENE: SLR1139} 

- Synechocystis sp. (strain PCC 6803) 



186 



(Q9USR1) Thioredoxin-like protein {GENE:SPBC577.08C} - 
Schizosaccharomyces pombe (Fission yeast) 



187 



(Q9R788) Thioredoxin {GENE.'TPTRX} - Treponema pallidum 



ANIMALS THIOREDOXIN-LIKE PROTEINS 



188 



189 



190 



(Q9UAV4) F46E10.9 PROTEIN (THIOREDOXIN-LIKE PROTEIN 
DPY-1 1) {GENE:F46E10.9 OR DPY-1 1} - Caenorhabditis 
elegans 



«19N2K6) Thioredoxin-like protein (Y54E10A.3 protein) 
(Thioredoxin-like protein TXL) {GENE:TXL OR Y54E10A.3} - 
Caenorhabditis elegans 



(Q9VRP3) THIOREDOXIN-LIKE PROTEIN TXL (CG5495 
• PROTEIN) {GENE:TXL OR CG5495} - Drosophila melanogaster 
(Fruit fly) : 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


191 


(043396) Thioredoxin-like protein {32 kDa thioredoxin-related 
protein). {GENE: TXNL OR TRP32 OR TXL} - Homo sapiens 
(Human) 


192 


(O76003) Thioredoxin-like protein - Homo sapiens (Human) 


193 


(Q9S753) THIOREDOXIN-LIKE PROTEIN {GENEiTRX} - Phalaris 
coerulescens 


194 


(O77404) TRYPAREDOXIN - Trypanosoma brucei brucei 


PLANT THIOREDOXIN-REDUCTASES i 



10 



195 v _ 

dependent thioredoxin-reductase 1) (NTR 1). {GENE: NTR1 OR 
AT4G35460 OR F15J1.30} - Arabidopsis thaliana (Mouse-ear 
cress) 

196 (Q39242) Thioredoxin-reductase 2 (EC 1.6.4.5) (NADPH- 
dependent thioredoxin-reductase 2) (NTR 2). {GENE: NTR 2 OR 
AT2G17420 OR F5J6.18) - Arabidopsis thaliana (Mouse-ear 
cress) 

VIRUSES, BA CTERIA AND FUNGI THIOREDOXIN-REDUCTASES 

197 (066790) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR AQ_500) - Aquifex aeolicus 



198 



(P80880) Thioredoxin-reductase (EC 1.6.4.5) (TRXR) (General 
stress protein 35) (GSP35). {GENE: TRXB} - Bacillus subtilis 



199 



(P94284) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR BB0515} - Borrelia burgdorferi (Lyme disease 
spirochete) 



200 



(PB7399) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR BU314} - Buchnera aphidicola (subsp. Acyrthosiphon 
pisum) (Acyrthosiphon pisum symbiotic bacterium) 



201 



(P8 1433) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Buchnera aphidicola (subsp. Schizaphis graminum) 



202 



(Q9PKT7) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR TC0375} - Chlamydia muridarum 
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SEQ. ID 
NO. 



203 



204 



205 



10 



206 



207 



208 



209 



210 



211 



212 



213 



214 



215 



2'16 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 

(Q9Z8M4) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR CPN0314 OR CP0444} - Chlamydia pneumoniae 
(Chlamydophila pneumoniae) 



(0841 01) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR CT099 } - Chlamydia trachomatis 

(P5221 3) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Clostridium litoraie (Bacterium W6) 



(P3991 6) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Coxiella burnetii 



(P09625) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR B0888 OR Z1232 OR ECS0973} - Escherichia coll, 
Escherichia coli Q157:H7 

(P50971) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Eubacterium acidaminophilum 



(P43788) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR H11 158} - Haemophilus influenzae 



(Q9ZL18) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR JHP0764} - Helicobacter pylori J99 (Campylobacter 
pylori J99) 

(P56431) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR HP0825} - Helicobacter pylori (Campylobacter pylori) 



(032823) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR LM Q2478} - Listeria monocytogenes 

(P47348) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR MG102} - Mycoplasma genitalium 



(P46843) Bifunctional thioredoxin-reductase/thioredoxin 
[Includes: Thioredoxin-reductase (EC 1.6.4.5) (TRXR); 
Thioredoxin]. {GENE: TRXB/A OR TRX OR ML2703} - 
Mycobacterium leprae 



(P75531) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR MPN24 0 OR MP591} - Mycoplasma pneumoniae 

(O30973) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - Mycobacterium smegmatis 
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SEQ. ID 
NO. 



217 



218 



219 



220 



221 



222 



223 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 

EXAMPLES OF REDOX PROTEINS 



(P52214) Thioredoxin-reductase (EC 1.6.4.5) (TRXR) (TR). 
{GENE: TRXB OR RV3913 OR MT4032 OR MTV028.04} - 
Mycobacterium tuberculosis - 



(P51 978) Thioredoxin-reductase (EC 1.6.4.5). {GENE: CYS-9} 
- Neurospora crassa 



(P43496) Thioredoxin-reductase (EC 1.6.4.5). {GENE: TRXB} - 
Penicillium chrysogenum . 



(Q9ZD97) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR RP445} - Rickettsia prowazekii 



(Q92375) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
SPBC3F6.03} - Schizosaccharomyces pombe (Fission yeast) 



(Q05741) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB} - S treptomyces clavuligerus 

(P52215) Thioredoxin-reductase (EC 1.6.4.5} (TRXR). {GENE: 
TRXB OR SCH24.12} - Streptomyces coelicolor 



224 



225 



10 



226 



(083790) Thioredoxin-reductase (EC 1.6.4.5) (TRXR). {GENE: 
TRXB OR TP0814} - Treponema pallidum 



(P80892) Thioredoxin-reductase (EC 1.6.4.5) (TRXR) 
(Fragment). {GENE: TRXB} - Vibrio fischeri 



(P29509) Thioredoxin-reductase 1 (EC 1.6.4.5). {GENE: TRR1 
OR YDR353W OR D9476.5} - Saccharomyces cerevisiae 
(Baker's yeast) 



15 



227 



(P38816) Thioredoxin-reductase 2, mitochondrial precursor (EC 
1.6.4.5). {GENE: TRR2 OR YHR106W} - Saccharomyces 
cerevisiae (Baker's yeast) 



228 



229 



230 



(062768) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
TXNRD1} - Bos taurus (Bovine) 



(Q1 7745) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
C06G3.7} - Caenorhabditis elegans 



(Q1 6881) Thioredoxin-reductase (EC 1.6.4.5). {GENE: 
TXNRD1 } - Homo sapiens (Human) 
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SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


I 231 


(Q25861) Thioredoxin-reductase (EC 1 .6.4.5) (TrxR). {GENE: 
TR OR GR} - Plasmodium falciparum (isolate FCH-5) 




Other thioredoxin-reductases 


II PLANTS THIOREDOXIN-REDUCTASES 


J 232 


(022229) Thioredoxin-reductase {GENE:AT2G41 680} - 
Arabidopsis thaliana (Mouse-ear cress) 


I 233 


(Q39951) NADPH thioredoxin-reductase (Fragment) - 
Helianthus annuus (Common sunflower) 



VIRUSES, BACTERIA AND FUNGI THIOREDOXIN-REDUCTASES 



234 



235 



(028718) THioredoxin-reductase (TRXB) {GENE:AF1 554} - 
Archaeo globus fulgidus 

(Q9K703) Thioredoxin-reductase (NADPH) (EC 1.6.4.5) 
(GENE:TRXB OR BH3571} - Bacillus halodurans 



236 



(Q9K7F3) Thioredoxin-reductase {GENE:BH3408} - Bacillus 
halodurans - 



10 



237 



(Q9KCZ0) Thioredoxin-reductase (GENE:BH1 429} - Bacillus 
halodurans . 



238 



(Q9KCZ1) Thioredoxin-reductase {GENE:BH1 428} - Bacillus 
halodurans . 



239 



(Q9PIY1) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB OR 
CJ0146} - Campylobacter jejuni _ < 



240 



241 



(Q9A4G3) Thioredoxin-reductase {GENE:CC287 1 } - 
Caulobacte r crescentus 

(Q97EM8) Thioredoxin-reductase {GENE:CAC3082} - 
Clostridium acetobutylicum 



15 



242 



(Q97IU2) Thioredoxin-reductase {GENE:CAC1 548} - 
Clostridium acetobutylicum 



243 



(Q9EV96) Thioredoxin-reductase {GENETRXB} - Clostridium 
sticklandii . 



244 



(Q9RSY7) THioredoxin-reductase {GENE:DR1 982} - 
Deinococcus radiodurans 
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245 



246 



247 



248 



249 



250 



251 



252 



10 



253 



254 



255 



256 



257 



15 



258 



259 



SWISS PROTEIN IDENTIFIER 

(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



(030739) Thioredoxin-reductase (Fragment) - Enterococcus 
faecalis (St reptococcus faecalis) 

(054535) Thioredoxin-reductase {GENEzTRXB OR TRXB1_2 
OR VNG6452G OR TRXB1 J OR VNG6074G} - Halobactenum 
sp. (strain NRC-P [Plasmid pNRCIOQ, and Plasmid pNRC200] 

(P82854) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB2} - 
Halobacterium sp. (strain NRC-1) 



(Q9HN08) Thioredoxin-reductase {GENE:TXRB3 OR 
VNG2301G} - Halobacterium sp. (strain NRC-1) 



(025779) THioredoxin-reductase (TRXB) {GENE:HP1 1 64} - 
Helicoba cter pylori (Campylobacter pylori) 

(086255) Thioredoxin-reductase {GENE:TRXB} - Klebsiella 
oxytoca 



(Q9AEV9) Thioredoxin-reductase (Fragment) {GENE:TRXB} - 
Lactococcus lactis (subsp. lactis) (Streptococcus lactis) 



(Q9CF34) Thioredoxin-reductase (EC 1.6.4.5) {GENETRXB2} - 
Lactococcus lactis (subsp. lactis) (Streptococcus lactis) 



(Q9CH02) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB1 } 
- Lactococcus lactis (subsp. lactis) (Streptococcus lactis) 

(Q9ZFC8) Thioredoxin-reductase (Fragment) {GENE:TRXB} - 
Lactococcus lactis 



(032822) Hypothetical 39.7 kDa protein (Fragment) - Listeria 
monocytogenes - 



(O26804) THioredoxin-reductase {GENE:MTH708} - 
Methanothermobacter thermautotrophicus 



(P94397) Homologue of thioredoxin-reductase of Mycoplama 
genitalium {GENE:YCGT} - Bacillus subtilis 



(Q98PK9) THioredoxin-reductase (EC 1.6.4.5) 
{GENE:MYPU_7 1 30} - Mycoplasma pulmonis 

(Q9JU23) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB OR 
NMA1538} - Neisseria meningitidis (serogroup A) 
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NO. 



260 



261 



262 



263 



264 



265 



266 



267 



268 



269 



270 



271 



272 



273 



274 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



(Q9JZ28) Thioredoxin-reductase {GENE:NMB1 324} - Neisseria 
m eningitidis (serogroup B) 
(Q9I0M2) Thioredoxin-reductase 1 {GENE:TRXB1 ORPA2616} 
Pseudomonas aeruginosa 



(Q9I592) Thioredoxin-reductase 2 {GENE:TRXB2 OR PA0849} 
Pseudomon as aeruginosa 

(Q9V0Q8) THioredoxin-reductase (TRXB) {GENE:TRXB OR 
PAB0500} - Pyrococcus abyssi • 



(Q9ZD33) THioredoxin-reductase (TRXB2) {GENE:RP514} - 
Rickettsia prowazekii _ 



(O 54079) Thioredoxin-reductase (EC 1.6.4.5) {GENE:TRXB} - 
Staphylococcus aureus 



(Q9RIS2) Thioredoxin-reductase {GENE:TRXB OR TRXB2} - 
Streptomyces coelicolor 

(Q9K4L6) Thioredoxin-reductase {GENE:SC5F8.08C} - 
Streptomyces coelicolor 



(Q97PY2) Thioredoxin-reductase {GENE:SP1 458} - 
Strepto coccus pneumoniae 

(Q9A0B5) Thioredoxin-reductase {GENE:SPY0850} - 
Streptococcus pyogenes m 



(Q97V69) Thioredoxin-reductase (trxB-2) (EC 1.6.4.5) 
{GENE:TRXB-2} - Sulfolobus solfataricus 

(Q97W27) Thioredoxin-reductase (trxB-3) (EC 1.6.4.5) 
(GENE: TRXB-3} - Sulfoiobus solfataricus 

(Q97WJ5) Thioredoxin-reductase (trxB-1) (EC 1.6.4.5) 
{GENE:TRXB-1 } - Sulfolobus solfataricus 

(Q98I59) Thioredoxin-reductase {GENE:MLL2552} -Rhizobium 
loti (Mesorhizobium loti) . 



(Q98M06) Thioredoxin-reductase {GENE:MLL0792} - 
Rhizobium loti (Mesorhizobium loti) 
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SEQ. ID 
NO. 



275 



276 



277 



278 



279 



280 



281 



282 



10 



283 



284 



SWISS PROTEIN IDENTIFIER 

(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



(Q9UR80) 35 kDa THioredoxin-reductase HOMOLOG 
(FRAGMENT) {GENE:TRR1 AND YDR353W} - Saccharomyces 
cer evisiae (Baker's yeast) 

(Q9ZEH4) THIOREDOXIN {GENE'.TRXA OR SA0992} - 
Staphylococcus aureus, Staphylococcus aureus subsp. aureus 

N315 

(Q9S1H1) Thioredoxin-reductase (Fragment) {GENE-.TRXB} - 
Staphylococcus xylosus 



(Q9HJI4) Thioredoxin-reductase {GENE:TA0984} - 
Thermo plasma acidophilum 

(Q9WZX3) THioredoxin-reductase {GENE:TM0869} - 
Thermotoga maritima 



(Q979K8) Thioredoxin-reductase {GENE:TVG1 183005} - 
Thermoplasma volcanium . 



(Q9PR71) Thioredoxin-reductase {GENE:TRXB OR UU074} - 
Ureaplasma parvum (Ureaplasma ureaiyticum biotype 1) 



(Q9KSS4) Thioredoxin-reductase {GENE:VC1 1 82} - Vibrio 
cholerae 

(Q9PDD1) Thioredoxin-reductase {GENE:XF1 448} - Xylella 
fastidiosa 

(09X5F7) Thioredoxin-reductase {GENE:TRXB1 } - Zymomonas 
mobilis 





/AIM 11VI AAl_ I 1 I 

285 1 


(Q9GKW9) Thioredoxin-reductase 3 (Fragment) {GENE:TRXR3} 
- Bos taurus (Bovine) 




\ 286 


(Q9N2I8) Thioredoxin-reductase (EC 1 .6.4.5) - Bos taurus 
(Bovine) . 




I 287 


(Q9N2K1) Thioredoxin-reductase homolog - Caenorhabditis 
elegans — 


15 


| 288 


(Q9NJH3) Thioredoxin-reductase - Caenorhabditis elegans || 
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SEQ. ID 
NO. 



294 



295 



296 



10 



297 



298 



299 



300 



15 



301 



302 



303 



SWISS PROTEIN IDENTIFIER 
(in parenthesis) 



EXAMPLES OF REDOX PROTEINS 



289 I (Q9VNT5) CG11401 PROTEIN (THioredoxin-reductase 2) 
{GENE:TRXR-2 OR CG1 1401} - Drosophila melanogaster. (Fruit 

fly) : , 

290 [(O95840) Thioredoxin-reductase - Homo sapiens (Human) 

291 | (Q9UES8) Thioredoxin-reductase GRIM-12 - Homo sapiens 
(Human) 

292 I (Q9UH79) Thioredoxin-reductase {GENE:TR} - Homo sapiens 
(Human) 



293 (Q9UQU8) Thioredoxin-reductase - Homo sapiens (Human) 



(Q9NNW6) Thioredoxin-reductase TR2 (Fragment) - Homo 
sapiens (Human) __ 



(Q9NNW7) Thioredoxin-reductase TR3 - Homo sapiens 
(Human) 



(Q9P101) Thioredoxin-reductase 3 (Fragment) {GENE:TRXR3} 
- Homo sapiens (Human) 



(Q9P2Y0) Thioredoxin-reductase II beta (EC 1 .6.4.5) - Homo 
sapiens (Human) 

(Q9H2Z5) Mitochondrial thioredoxin-reductase 
(GENE:TRXR2A) - Homo sapiens (Human) 



(Q99475) KM-1 02-DERIVED REDUCTASE-LIKE FACTOR 
(THiore doxin-reductase) - Homo sapiens (Human) 

(Q99P49) Thioredoxin-reductase 1 {GENE:TXNRD1} - Mus 
musculus (Mouse) 



(Q9CSV5) Thioredoxin-reductase 1 (Fragment) 
{GENE:TXNRD1 } - Mus musculus (Mouse) 

(Q9CZE5) Thioredoxin-reductase 1 {GENE:TXNRD1 } - Mus 
mus culus (Mouse) 

(Q9JHA7) Thioredoxin-reductase TR3 {GENE:TXNRD2 OR 
TR3} - Mus musculus (Mouse) 



304 



(Q9JLT4) Thioredoxin-reductase {GENE:TXNRD2 OR TRXR2} 
Mus musculus (Mouse) 
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, SEQ. ID 
NO. 


SWISS PROTEIN IDENTIFIER 
(in parenthesis) 


EXAMPLES OF REDOX PROTEINS 


305 


(Q9JMH5) Thioredoxin-reductase 2 {GENE:TXNRD2 OR 
TYMPnol _ iv/iiic miiRrnlus (Mouse) 


306 


(Q9JMH6) Thioredoxin-reductase 1 {GENE:TXNRD1 OR 
TXNRD1} - Mus musculus (Mouse) — — 


307 


(089049) Thioredoxin-reductase - Rattus norvegicus (Rat) 


| 308 


(Q9JKZ3) Thioredoxin-reductase 1 (Fragment) - Rattus 
norvegicus (Rat) 


I 309 


(09JKZ4) Thioredoxin-reductase 1 - Rattus norvegicus (Rat) 


310 


(Q9JLE6) Thioredoxin-reductase (Fragment) - Rattus 
norvegicus (Rat) 


I 311 


(09R1I3) NADPH-dependent thioredoxin-reductase 
{GENE:TRR1 } - Rattus norvegicus (Rat) 


312 


(Q9Z0J5) Thioredoxin-reductase precursor {GENE:TRXR2} - 
Rattus norvegicus (Rat) 


313 


(Q9MYY8) Redox enzyme thioredoxin-reductase - Sus scrofa 
(Pig) 
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WHAT IS CLAIMED IS: 

1 . A method of producing an oil body associated with a recombinant 

multimeric-protein-complex, said method comprising: 

(a) producing in a cell comprising oil bodies, a first recombinant polypeptide and 
5 a second recombinant polypeptide wherein said first recombinant polypeptide is 

capable of associating with said second recombinant polypeptide to form said 
multimeric-protein-complex; and 

(b) associating said multimeric-protein-complex with an oil body through an oil- 
body-targeting-protein capable of associating with said oil body and said first 

10 recombinant polypeptide. 

2. The method of claim 1 further comprising (c) isolating said oil 
bodies associated with said recombinant multimeric-protein-complex. 

3. The method of claim 1 wherein said multimeric-protein-complex 
associates with oil bodies obtainable from said cell comprising oil bodies. 

15 4. The method of claim 1 wherein said multimeric-protein-complex 

associates intracellular^ with said oil bodies. 

5. The method of claim 1 wherein said second recombinant 

polypeptide is associated with a second oil-body-targeting-protein capable of 

associating with an oil body and said second recombinant polypeptide. 
20 6. The method of claim 6 wherein each of said oil-body-targeting- 

proteins is an oil-body-protein or an immunoglobulin. 

7. The method of claim 6 wherein said oil-body-targeting-protein is an 

oleosin or caleosin. 

8. The method of claim 1 wherein said oil-body-targeting-protein is an 
25 oleosin or caleosin and said first recombinant polypeptide is fused to said oleosin 

or caleosin. 

9. The method of claim 8 wherein said second recombinant 
polypeptide is fused to a second oleosin or second caleosin capable of 

associating with an oil body. 
30 10 . The method of claim 1 wherein said first and second recombinant 

polypeptides are produced as a multimeric-fusion-protein comprising said first 
and second recombinant polypeptide. 
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1 1 . The method of claim 1, wherein said multimeric-protein-complex is a 
heteromultimeric-protein-complex. 

12. The method of claim 11 wherein said heteromultimeric-protein- 
complex is an enzymatically active redox complex or an immunoglobulin. 
5 13. The method of claim 1, wherein said first recombinant polypeptide is 

capable of associating with said second recombinant polypeptide in the cell. 

14. The method of claim 1 wherein said first recombinant polypeptide is 
a thioredoxin and said second recombinant polypeptide is a thioredoxin- 
reductase. 

10 15# Tne method of claim 14, wherein said thioredoxin is selected from 

the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID N0s:52-194. 

16. The method of claim 14, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

<15 17. The method of claim 1 wherein said cell is a plant cell. 

18. The method of claim 1 wherein said cell is a safflower cell. 

19. The method of claim 1 wherein said first recombinant polypeptide is 
an immunoglobulin-polypeptide-chain. 

20. The method of claim 1 wherein said first recombinant polypeptide is 
20 an immunoglobulin light chain, or an immunologically active portion thereof, and 

said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 

21. The method of claim 19 wherein said oil-body targeting-protein 

comprises protein A, protein L or protein G. 
25 22. A method of expressing a recombinant multimeric-protein-complex 

comprising a first and second recombinant polypeptide in a ceil, said method 
comprising: 

(a) introducing into a cell a first chimeric nucleic acid sequence comprising: 

(i) a first nucleic acid sequence capable of regulating transcription 

30 in said cell operatively linked to; 

(ii) a second nucleic acid sequence encoding a first recombinant 

polypeptide; 
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(b) introducing into said cell a second chimeric nucleic acid sequence comprising: 

(i) a third nucleic acid sequence capable of regulating transcription 

in said cell operatively linked to; 

(ii) a fourth nucleic acid sequence encoding a second recombinant 

5 polypeptide; 

(c) growing said cell under conditions to permit expression of said first and 
second recombinant polypeptide in a progeny cell comprising oil bodies wherein 
said first recombinant polypeptide and said second recombinant polypeptide are 
capable of forming a multimeric-protein-complex; and 

10 (d) associating said first recombinant polypeptide with an oil body through an oil- 
body-targeting-protein capable of associating with said oil body and said first 

recombinant polypeptide. 

23. The method of claim 22 further comprising (e) isolating from said 
progeny cell, oil bodies comprising said multimeric-protein-complex. 
15 24. The method of claim 22 wherein said multimeric-protein-complex 

associates with said oil bodies obtainable from said progeny cell comprising oil 
bodies. 

25. The method of claim 22 wherein said oil bodies associate 
intracellular^ with said multimeric-protein-complex. 
20 26. The method of claim 22 wherein said second recombinant 

polypeptide is associated with a second oil-body-targeting-protein capable of 
associating with an oil body and said second recombinant polypeptide. 

27. The method of claim 26 wherein each of said oil-body-targeting- . 
proteins is selected from an oil-body-protein or an immunoglobulin. 
25 28. The method of claim 27 wherein- said oii-body-protein is an oleosin or 

caleosin. 

29. The method of claim 28 wherein said first recombinant polypeptide is 
fused to said oleosin or caleosin. 

30. The method of claim 29 wherein said second recombinant 
30 polypeptide is fused to a second oleosin or second caleosin capable of 

associating with an oil body. 
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31 . The method of claim 22 wherein said first and second recombinant 
polypeptide are produced as a multimeric-fusion-protein comprising said first and 

second recombinant polypeptide. 

32. The method of claim 22 wherein said first and second recombinant 
5 polypeptide form a multimeric-protein-complex. 

33. The method of claim 32, wherein said multimeric-protein-complex is 
a heteromultimeric-protein-compiex. 

34. The method of claim 32 wherein said heteromultimeric-protein- 
complex is an enzymatically active redox complex or an immunoglobulin. 

<I0 35. The method of claim 22 wherein said first recombinant polypeptide 

and said second recombinant polypeptide are capable of forming a multimeric- 
protein-complex in said progeny cell. 

36. The method of claim 22 wherein said first recombinant polypeptide is 
a thioredoxin and said second recombinant polypeptide is a thioredoxin- 

15 reductase. 

37. The method of claim 36, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

38. The method of claim 36, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 

20 40, 44, 48, 50 and SEQ ID NOs:195-313. 

39. The method of claim 22 wherein said first recombinant polypeptide is 
an immunoglobulin-polypeptide-chain. 

40. The method of claim 22 wherein said first recombinant polypeptide is 
an immunoglobulin light chain, or an immunologically active portion thereof, and 

25 said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
Immunologically active portion thereof. 

41. The method of claim 39 wherein said oil-body targeting-protein 
comprises protein A, protein L or protein Q. 

30 42. The method of claim 22 wherein said cell is a plant cell. 

43. The method of claim 42 wherein said plant cell is a safflower cell. 
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44. A method of producing in a plant a recombinant multimeric-protein- 
complex, said method comprising: 

(a) preparing a first plant comprising cells, said cells comprising oil bodies and a 
first recombinant polypeptide wherein said first recombinant polypeptide is 

5 capable of associating with said oil bodies through an oil-body-targeting-protein; 

(b) preparing a second plant comprising cells, said cells comprising oil bodies and 
a second recombinant polypeptide; and 

(c) sexually crossing said first piant with said second plant to produce a progeny 
plant comprising cells, said cells comprising oil bodies, wherein said oil bodies 

1 0 are capable of associating with said first recombinant polypeptide, and said first 
recombinant recombinant polypeptide is capable of associating with said second 
recombinant polypeptide to form said recombinant multimeric-protein-complex. 

45. The method of claim 44 wherein said second recombinant 
polypeptide is capable of associating with oil bodies through an oil-body- 

15 targeting-protein in said second plant. 

46. The method of claim 44 further comprising (d) isolating from said 
progeny plant oil bodies comprising said multimeric-protein-complex. 

47. The method of claim 44 wherein said oil-body-targeting-protein is 
selected from an oil-body-protein or an immunoglobulin. 

20 48. The method of claim 47 wherein said oil-body-protein is an oleosin or 

caleosin. 

49. The method of claim 48 wherein said first recombinant polypeptide is 

fused to said oleosin or caleosin. 

50. The method of claim 49 wherein said second recombinant 
25 polypeptide is fused to a second oleosin or second caleosin capable of 

associating with an oil body. 

51. The method of claim 44 wherein said first and second recombinant 

polypeptide form a multimeric-protein-complex. 

52. The method of claim 51, wherein said multimeric-protein-complex is 

30 a heteromultimeric-protein-complex. 

53. The method of claim 52 wherein said heteromultimeric-protein- 
complex is an enzymatically active redox complex or an immunoglobulin. 
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54. The method of claim 44 wherein said first recombinant polypeptide is 
a thioredoxin and said second recombinant polypeptide is a thioredoxin- 
reductase. 

55. The method of claim 54, wherein said thioredoxin is selected from 
5 the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

56. The method of claim 54, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

57. The method of claim 44 wherein said first recombinant polypeptide is 
10 an immunoglobulin-polypeptide-chain. 

58., The method of claim 44 wherein said first recombinant polypeptide is 
an immunoglobulin light chain, or an immunologically active portion thereof, and 
said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 
15 59. The method of claim 57 wherein said oil-body targeting-protein 

comprises protein A, protein L or protein G. 

60. The method of claim 44 wherein said plant is safflower. 

61. A chimeric nucleic acid sequence encoding a multimeric-fusion- 
protein, said nucleic acid comprising: 

20 (a) a first nucleic acid sequence encoding an oil-body-targeting-protein 
operatively linked in reading frame to; 

(b) a second nucleic acid sequence encoding a first recombinant polypeptide; 
linked in reading frame to; 

(c) a third nucleic acid sequence encoding a second recombinant polypeptide, 
25 wherein said first and second recombinant polypeptide are capable of forming a 

multimeric-protein-complex. 

62. The nucleic acid of claim 61, wherein said oil-body-targeting-protein 
is selected from an oil-body-protein or an immunoglobulin. 

63! The nucleic acid of claim 62, wherein said oil-body-protein is an 

30 oleosin or caleosin. 

64. The nucleic acid of claim 63, wherein said multimeric-protein- 
complex is a heteromultimeric-protein-complex. 



WO 02/50289 



PCTAJS01/50240 



-130- 



65. The chimeric nucleic acid sequence of claim 61 wherein said first 
and second recombinant polypeptide form an enzymatically active 
heteromultimeric redox complex or an immunoglobulin. 

66. The chimeric nucleic acid sequence of claim 65 wherein said first 
5 and second recombinant polypeptides are a thioredoxin and a thioredoxin- 

reductase. 

67. The chimeric nucleic acid of claim 66, wherein said thioredoxin is 
selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 
NOs:52-194. 

10 68. The chimeric nucleic acid of claim 66, wherein said thioredoxin- 

reductase is selected from the group consisting of those set forth in SEQ ID 
NOs:8, 9, 1 0, 40, 44, 48, 50 and SEQ ID NOs:1 95-31 3. 

69. The chimeric nucleic acid of claim 65 wherein said first recombinant 
polypeptide is an immunoglobulin-polypeptide-chain. 

15 70. The chimeric nucleic acid of claim 66 wherein said first recombinant 

polypeptide is an immunoglobulin light chain, or an immunologically active 
portion thereof, and said second recombinant polypeptide is an immunoglobulin 
heavy chain, or an immunologically active portion thereof. 

71. The chimeric nucleic acid of claim 69 wherein said oil-body targeting- 
20 protein comprises protein A, protein L or protein Q. 

72. The nucleic acid of claim 61, wherein positioned between said 
nucleic acid sequence encoding an oil-body-targeting-protein and said nucleic 
acid sequence encoding a first recombinant polypeptide is a linker nucleic acid 
sequence encoding an oil-body-surface-avoiding linker amino acid sequence. 

25 73. The nucleic acid of claim 72, wherein said oil-body-surface-avoiding 

linker amino acid sequence is substantially negatively charged, or has a 
molecular weight of at least 35 kd. 

74. The nucleic acid of claim 73, wherein the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 
30 is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 
and said sequence encoding the first recombinant polypeptide. 
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75. A recombinant multimeric-fusion-protein comprising (i) an oil-body- 
targeting-protein, or fragment thereof, (ii) a first recombinant polypeptide and a 
(iii) second recombinant polypeptide, wherein said first and second recombinant 
polypeptides are capable of forming a multimeric-protein-complex. 
5 76. The recombinant multimeric-fusion-protein of claim 75 wherein said 

oil-body-targeting-protein is selected from an oil-body-protein or an 
immunoglobulin. 

77. The recombinant multimeric-fusion-protein of claim 76 wherein said 

oil-body-protein is an oleosin or a caleosin. 
10 78. The recombinant multimeric-fusion-protein of claim 77, wherein said 

multimeric-fusion-protein is a heteromultimeric-fusion-protein. 

79. The recombinant heteromultimeric-fusion-protein of claim 78 wherein 

said first and second recombinant polypeptide form an enzymatically active 

heteromultlmeric redox complex or an immunoglobulin. 
15 80. The recombinant fusion polypeptide of claim 79 wherein said first 

recombinant polypeptide is a thioredoxin and said second recombinant 

polypeptide is a thioredoxin-reductase. 

81. The recombinant fusion polypeptide of claim 80, wherein said 
thioredoxin is selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 

20 and SEQ ID NOs:62-1 94. 

82. The recombinant fusion polypeptide of claim 80, wherein said 
thioredoxin-reductase is selected from the group consisting of those set forth in 
SEQ ID NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID NOs:195-313. 

83. The recombinant fusion polypeptide of claim 75, wherein positioned 
25 between said oil-body-targeting-protein and said first recombinant polypeptide is 

an oil-body-surface-avoiding linker amino acid sequence. 

84. The recombinant fusion polypeptide of claim 83, wherein said oil- 
body-surface-avoiding linker amino acid sequence is substantially negatively 
charged, or has a molecular weight of at least 35 kd. 

30 85. The recombinant fusion polypeptide of claim 84, wherein the fusion 

polypeptide further comprises a linker amino acid sequence that is specifically 
cleavable by an enzyme or a chemical, wherein the linker sequence is positioned 
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between the oil-body-surface-avoiding linker amino acid sequence and said first 

recombinant polypeptide. 

86. Isolated oil bodies comprising a multimeric-protein-complex 
comprising (i) an oil-body-targeting-protein and (ii) a first recombinant 

5 polypeptide, said oil bodies further comprising a second recombinant 

polypeptide, wherein said first and second recombinant polypeptide are capable 
of forming a multimeric-protein-complex. 

87. Isolated oil bodies of claim 86 wherein said oil-body-targeting-protein 
is an oil-body-protein or an immunoglobulin. 

10 88. Isolated oil bodies of claim 87 wherein said oil-body-protein is an 

oleosin or a caleosin. 

89. Isolated oil bodies of claim 88 wherein said first recombinant 

polypeptide is fused to said oleosin or caleosin. 

90. Isolated oil bodies of claim 86 wherein said first recombinant 
15 polypeptide is fused to said second recombinant polypeptide. 

91. The isolated oil bodies of claim 90, wherein said multimeric-protein- 
complex is a heteromultimeric-protein-complex. 

92. The isolated oil bodies of claim 91 wherein said heteromultimeric- 
protein-complex is an enzymatically active redox complex or an immunoglobulin. 

20 93. Isolated oil bodies comprising 

(a) a first fusion protein comprising a first oil-body-targeting-protein fused to a 
first recombinant polypeptide; and 

(b) a second fusion protein comprising a second oil-body-targeting-protein fused 
to a second recombinant polypeptide, 

25 wherein said first and second recombinant polypeptide are capable of forming a 
multimeric-protein-complex. 

94. Isolated oil bodies of claim 93 wherein said first oil-body-targeting- 
protein is an oil-body-protein or an immunoglobulin. 

95. Isolated oil bodies according claim 93 wherein said first oil-body- 

30 protein is an oleosin or a caleosin. 

96. The isolated oil bodies of claim 93, wherein said multimeric-protein- 
complex is a heteromultimeric-protein-complex. 



WO 02/50289 



PCT/IIS01/50240 



-133- 



97. Isolated oil bodies of claim 93 wherein said first and second 
recombinant polypeptide form an enzymatically active heteromultimeric redox 
complex or an immunoglobulin. 

98. Isolated oil bodies of claim 93 wherein said first recombinant 
5 polypeptide is a thioredoxin and said second recombinant polypeptide is a 

thioredoxin-reductase. 

99. The oil bodies of claim 98, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

100. The oil bodies of claim 98, wherein said thioredoxin-reductase is 
10 selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 

40, 44, 48, 50 and SEQ ID NOs:195-313. 

101 . The oil bodies of claim 93 wherein said first recombinant 
polypeptide is an immunoglobulin-polypeptide-chain. 

102. The oil bodies of claim 93 wherein said first recombinant 

15 polypeptide is an immunoglobulin light chain, or an immunologically active 

portion thereof, and said second recombinant polypeptide is an immunoglobulin 
heavy chain, or an immunologically active portion thereof. 

103. The oil bodies of claim 101 wherein said oil-body targeting-protein 
comprises protein A, protein L or protein G. 

20 104. A cell comprising oil bodies and (i) an oil-body-targeting-protein, (ii) 

a first recombinant polypeptide and (iii) a second recombinant polypeptide 
wherein 

(1 ) said first recombinant polypeptide is capable of associating with said oil- 
body-targeting-protein; and 
25 (2) said first recombinant polypeptide capable of associating with said second 
recombinant polypeptide to form a multimeric-protein-complex. 

105. The cell of claim 104 wherein said oil-body-targeting-protein is an 

oil-body-protein or an immunoglobulin. 

106. The cell of claim 105 wherein said oil-body-protein is an oleosin or 

30 caleosin. 
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107. The cell of claim 104 wherein said first recombinant polypeptide is 
fused to said second recombinant polypeptide so as to form a multimeric-fusion- 
protein. 

108. The cell of claim 107 wherein said multimeric-fusion-protein is a 
5 heteromultimeric-fusion-protein. 

109. The cell of claim 104 wherein said first recombinant polypeptide is 

fused to said oil-body-targeting-protein. 

110. The cell of claim 104 wherein said first recombinant polypeptide is 
fused to said first oil-body-targeting-protein and said second polypeptide is fused 

10 to a second oil-body-targeting-protein. 

111. The cell of claim 1 04 wherein said second recombinant polypeptide 
is capable of associating with a second oil-body-targeting-protein. 

1 1 2. The cell of claim 104 wherein said first and second recombinant 
polypeptide form a heteromultimeric-protein-complex. 
15 1 13. The cell of claim 104 wherein said heteromultimeric-protein- 

complex is an enzymatically active redox complex or an immunoglobulin. 

114. The cell of claim 104 wherein said first polypeptide is a thioredoxin 
and said second polypeptide is a thioredoxin-reductase. 

115. The cell of claim 114, wherein said thioredoxin is selected from the 
20 group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-1 94. 

1 1 6. The cell of claim 1 1 4, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

117. The cell of claim 104 wherein said first recombinant polypeptide is 
25 an immunoglobulin-polypeptide-chain. 

118. The cell of claim 104 wherein said first recombinant polypeptide is 
an immunoglobulin light chain, or an immunologically active portion thereof, and 
said second recombinant polypeptide is an immunoglobulin heavy chain, or an 
immunologically active portion thereof. 

30 119. The cell of claim 1 1 7 wherein said oil-body targeting-protein 

comprises protein A, protein L or protein G. 

120. The cell of claim 104 wherein said cell is obtained from a plant. 
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1 21 . The cell of claim 104 wherein said cell is obtainable from a 

safflower plant. 

122. A plant comprising cells of claim 104. 

123. A safflower plant comprising cells of claim 104. 

5 1 24. The method of claim 2 wherein said first recombinant polypeptide is 

a thioredoxin and said second recombinant polypeptide is a thioredoxin- 
reductase, said method further comprising (d) formulating the oil bodies for use 
in the preparation of a food product, personal care product or pharmaceutical 
composition. 

10 1 25. The method of claim 1 24, wherein said thioredoxin is selected from 

the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-1 94. 

1 26. The method of claim 1 24, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

15 127. The method of claim 124 wherein said formulating comprises the 

addition of NADP or NADPH. 

128. The method of claim 124 wherein said food product is a milk or 

wheat based food product. 

1 29. The method of claim 124 wherein said personal care product 
20 reduces the oxidative stress to the surface area of the human body or is used to 
lighten the skin. 

130. The method of claim 124 wherein said pharmaceutical composition 
is used to treat chronic obstructive pulmonary disease (COPD), cataracts, 
diabetes, envenomation, bronchiopulmonary disease, malignancies, psoriasis, 

25 reperfusion injury, wound healing, sepsis, Gl bleeding, intestinal bowel disease 
(IBD), ulcers, GERD (gastro esophageal reflux disease). 

131. A composition comprising isolated oil bodies, thioredoxin and 

thioredoxin-reductase. 

132. The composition of claim 131, wherein said thioredoxin is selected 
30 from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52- 

194. 
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133. The composition of claim 131, wherein said thioredoxin-reductase 
is selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

134. The composition of claim 131 further comprising NADP or NADPH. 
5 135. A food product, personal care product or pharmaceutical 

composition comprising the composition of claim 131. 

136. The food product, personal care product or pharmaceutical 
composition of claim 135 further comprising NADP or NADPH. 

137. The food product of claim 135 wherein said food product is a milk 

10 based or wheat based food product. 

138. The personal care product of claim 135 wherein said personal care 
product reduces the oxidative stress to the surface area of the human body or is 

used to lighten the skin. 

139. The pharmaceutical composition of claim 135 wherein said 
15 pharmaceutical composition is used to treat chronic obstructive pulmonary 

disease, cataracts, psoriasis or reperfusion injury. 

140. The multimeric-fusion-protein of claim 75, wherein said fusion- 
protein contains two or more polypeptide chains selected from the group of 
proteins set forth in Figure 5. 

20 141 . A method of reducing allergenicity of a food comprising the steps 

of: 

providing the isolated oil bodies of claim 78; and 

adding the isolated oil bodies to the food, whereby allergenicity of 

the food is reduced. 

25 142. The method of claim 141, wherein the food is selected from the 

group consisting of wheat flour, wheat dough, milk, cheese, yogurt and ice 
cream. 

143. The method of claim 141, further comprising providing NADH as a 
co-factor in the substantial absence of NADPH. 
30 144. A method of treating or protecting a target against oxidative stress, 

comprising the steps of: 

providing the recombinant fusion polypeptide of claim 46; and 
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contacting the recombinant fusion polypeptide with a target, 
wherein the target is susceptible to oxidative stress, thereby treating or 
protecting against the stress. 

145. The method of claim 144, wherein the target is selected from the 
5 group consisting of a molecule, a molecular complex, a cell, a tissue, and an 

organ. 

146. A method for preparing an enzymatically active redox protein 
associated with oil bodies comprising: 

a) producing in a cell a redox fusion polypeptide comprising a first 
10 redox protein linked to a second redox protein; 

b) associating said redox fusion polypeptide with oil bodies through 
an oil-body-targeting-protein capable of associating with said redox fusion 
polypeptide and said oil bodies; and 

c) isolating said oil bodies associated with said redox fusion 

15 polypeptide. 

147. The method of claim 146 wherein said oil-body-targeting-protein is 
an oil-body-protein or an immunoglobulin. 

148. The method of claim 146 wherein said oil-body-protein is an oleosin 

or a caleosin. 

20 149. The method of claim 146 wherein said first redox protein is a 

thioredoxin and said second redox protein is a thioredoxin-reductase. 

150. The method of claim 149, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

151. The method of claim 149, wherein said thioredoxin-reductase is 
25 selected from the group consisting of those set forth in SEQ ID N0s:8, 9, 10, 

40, 44, 48, 50 and SEQ ID NOs:195-313. 

1 52. The method of claim 146 wherein said cell is a plant cell. 

153. The method of claim 146 wherein said ceil is a safflower cell. 

1 54. A method for preparing a redox protein associated with oil bodies 

30 comprising: 

a) introducing into a cell a chimeric nucleic acid sequence comprising: 
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1 ) a first nucleic acid sequence capable of regulating 
transcription in said cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant 
fusion polypeptide comprising (i) a nucleic acid sequence encoding 

5 a sufficient portion of an oil-body-protein to provide targeting of 

said recombinant fusion polypeptide to an oil body linked to (ii) a 
nucleic acid sequence encoding a redox fusion polypeptide 
comprising a first redox protein linked to a second redox protein 
operatively linked to; 
1 0 3) a third nucleic acid sequence capable of terminating 

transcription in said cell; 

b) growing said cell under conditions to permit expression of said 
redox fusion polypeptide in a progeny cell comprising oil bodies; and 

c) isolating from said progeny cell said oil bodies comprising said 

15 redox fusion polypeptide. 

155. The method of claim 154, wherein positioned between said nucleic 
acid sequence encoding a sufficient portion of an oil-body-protein and said 
nucleic acid sequence encoding a redox fusion polypeptide is a linker nucleic 
acid sequence encoding an oil-body-surface-avoiding linker amino acid sequence. 

20 156. The method of claim 155, wherein said oil-body-surface~avoiding 

linker amino acid sequence is substantially negatively charged, or has a 
molecular weight of at least 35 kd. 

157. The method of claim 156, wherein the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 

25 is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 
and said nucleic acid sequence encoding a redox fusion polypeptide. 

158. The method of claim 157, further comprising introducing an 
enzyme or chemical that cleaves said redox fusion polypeptide from said oil 

30 body, thereby obtaining isolated redox fusion polypeptide. 

159. The method of claim 154 wherein said oil-body-protein is an oleosin 

or a caleosin. 
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1 60. The method of claim 1 54 wherein said first redox protein is a 
thioredoxin and said second redox protein is a thioredoxin-reductase. 

161 . The method of claim 160, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 
5 1 62. The method of claim 1 60, wherein said thioredoxin-reductase is 

selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 

1 63. The method of claim 1 54 wherein said ceil is a plant cell. 

164. The method of claim 154 wherein said thioredoxin and thioredoxin- 
10 reductase is obtained from Arabidopsis. 

165. The method of claim 146 wherein the first redox protein is at least 
5 times more active when produced as a redox fusion polypeptide as compared 
to the production of the first redox protein without the second redox protein. 

166. The method of claim 146 further comprising: 

15 d) formulating an emulsion of the oil bodies associated with the redox 

fusion polypeptide for use in the preparation of a product capable of treating 
oxidative stress in a target, a product capable of chemically reducing a target, 
pharmaceutical composition, a personal care product or a food product. 

167. A chimeric nucleic acid comprising: 

20 1) a first nucleic acid sequence capable of regulating transcription in 

a host cell operatively linked to; 

2) a second nucleic acid sequence encoding a recombinant fusion 
polypeptide comprising (i) a nucleic acid sequence encoding a sufficient portion 
of an oil-body-protein to provide targeting of said recombinant fusion polypeptide 

25 to an oil body linked to (ii) a nucleic acid sequence encoding a redox fusion 
polypeptide comprising a first redox protein linked to a second redox protein 

operatively linked to; 

3) a third nucleic acid sequence capable of terminating transcription 

in said cell. 

30 168. The chimeric nucleic acid of claim 167 wherein said oil-body-protein 

is an oleosin or a caleosin. 
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169. The chimeric nucleic acid of claim 167 wherein said first redox 
protein is a thioredoxin and said second redox protein is a thioredoxin-reductase. 

170. The chimeric nucleic acid of claim 169, wherein said thioredoxin is 
selected from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID 

5 NOs:52-194. 

171. The chimeric nucleic acid of claim 169, wherein said thioredoxin- 
reductase is selected from the group consisting of those sot forth in SEQ ID 
NOs:8, 9, 10, 40, 44, 48, 50 and SEQ ID NOs:195-313. 

172. The chimeric nucleic acid of claim 167 wherein said cell is a plant 

10 cell. 

173. The chimeric nucleic acid of claim 167, wherein positioned between 
said nucleic acid sequence encoding a sufficient portion of an oil-body-protein 
and said nucleic acid sequence encoding a redox fusion polypeptide is a linker 
nucleic acid sequence encoding an oil-body-surface-avoiding linker amino acid 

16 sequence. 

174. The chimeric nucleic acid of claim 173, wherein said oil-body- 
surface-avoiding linker amino acid sequence is substantially negatively charged, 
or has a molecular weight of at least 35 kd. 

175. The chimeric nucleic acid of claim 174, wherein the gene fusion 
20 further comprises a linker nucleic acid sequence encoding an amino acid 

sequence that is specifically cleavabie by an enzyme or a chemical, wherein the 
linker sequence is positioned between the oil-body-surface-avoiding linker amino 
acid sequence and said nucleic acid sequence encoding a redox fusion 
polypeptide. 

25 176. A transgenenic plant comprising the chimeric nucleic acid sequence 

of claim 167. 

177. The transgenic plant of claim 176, wherein said chimeric nucleic 

acid is contained within a piastid. 

178. A saffiower plant comprising the chimeric nucleic acid of anyone of 

30 claim 1 67. 

179. The saffiower plant of claim 178, wherein said chimeric nucleic 
acid is contained within a piastid. 
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180. A plant seed comprising the chimeric nucleic acid of claim 167. 

181. The plant seed of claim 180 t wherein said chimeric nucleic acid is 

contained within a plastid. 

182. A safflower seed comprising the chimeric nucleic acid of claim 168. 
5 183. The safflower seed of claim 182, wherein said chimeric nucleic acid 

is contained within a plastid. 

184. An oil body preparation obtained by the method of claim 146. 

185. A food product comprising an oil body preparation of claim 184. 

186. A composition comprising an oil body preparation of claim 184. 

10 187. A personal care product comprising an oil body preparation of claim 

184. 

188. A product capable of treating oxidative stress in a target comprising 
an oil body preparation of claim 184. 

189. A product capable of chemically reducing a target comprising an oil 

15 body preparation of claim 184. 

190. A detergent composition comprising the product of claim 184. 

191. A method of cleansing an item, comprising administering the 
product of claim 189 to said item under conditions that promote cleansing. 

192. An emulsion formulation prepared by the method of claim 166, 
20 193. A nucleic acid construct comprising a gene fusion, wherein the 

gene fusion comprises a first region encoding an oil-body-protein or an active 
fragment thereof, operably linked to a second region encoding at least one 
thioredoxin-related protein or an active fragment thereof. 

1 94. The construct of claim 193, wherein the at least one thioredoxin- 

25 related protein is thioredoxin. 

195. The construct of claim 194, wherein said thioredoxin is selected 
from the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52- 
194. 

196. The construct of claim 194, wherein the thioredoxin is derived from 

30 Arabidopsis or wheat. 

197. The construct of claim 193, wherein the at least one thioredoxin- 
related protein is thioredoxin-reductase. 
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198. The construct of claim 197, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:1 95-31 3. 

199. The construct of claim 197, wherein the thioredoxin-reductase is 

5 derived from Arabidopsis or wheat. 

200. The construct of claim 197, wherein the thioredoxin-reductase is an 

NADPH-dependent thioredoxin-reductase. 

201. The construct of claim 193, wherein the second region encodes a 
thioredoxin and thioredoxin-reductase. 

10 202. The construct of claim 201, wherein the thioredoxin and 

thioredoxin-reductase is obtained from Mycobacterium leprae. 

203. The construct of claim 201, wherein at least one thioredoxin-related 

protein Is an engineered fusion protein. 

204. The construct of claim 193, wherein the first region precedes, in a 

15 5 ' to 3' direction, the second region. 

205. The construct of claim 193, wherein the first region follows, in a 

5 ' to 3' direction, the second region. 

206. The construct of claim 193, wherein the gene fusion further 
comprises a third region encoding a second thioredoxin-related protein or an 

20. active fragment thereof, operably linked to the first region, or to the second 

region, or to both. 

207. The construct of claim 193, further comprising a seed-specific 

promoter operably linked to the gene fusion. 

208. The construct of claim 207, wherein the promoter is a phaseolin 

25 promoter. 

209. The construct of claim 193, wherein at least one thioredoxin- 
related protein is derived from a plant species selected from the group consisting 

of Arabidopsis and wheat. 

210. The construct of claim 193, wherein at least one thioreoxin-related 

30 protein is derived from E. coli. 
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21 1 . The construct of claim 1 93 further comprising a nucleic acid 
effective as a termination region in plant cells, operably linked to the gene 
fusion. 

212. The construct of claim 193, wherein the gene fusion further 

6 comprises a nucleic acid sequence encoding an oil-body-surface-avoiding linker 
amino acid sequence, wherein the linker amino acid sequence is positioned 
between the first region and the second region. 

213. The construct of claim 212, wherein said oil-body-surface-avoiding 
linker amino acid sequence is substantially negatively charged, or has a 

10 molecular weight of at least 35 kd. 

214. The construct of claim 213, wherein the gene fusion further 
comprises a linker nucleic acid sequence encoding an amino acid sequence that 
is specifically cleavable by an enzyme or a chemical, wherein the linker sequence 
is positioned between the oil-body-surface-avoiding linker amino acid sequence 

15 and the second region. 

215. The construct of claim 193, wherein a region of the gene fusion 
comprises a plurality of codons, each codon specifying a single amino acid, 
wherein at least one of the codons is modified from a naturally occurring codon 
within the region, 

20 216. The construct of claim 215, wherein the modified codon specifies 

the same amino acid as the naturally occurring codon, and wherein the modified 
codon is modified according to a codon preference of a plant. 

217. The construct of claim 215, wherein the modified codon specifies 
an amino acid that is different from the amino acid specified by the naturally 

25 occurring codon. 

218. A transgenic plant containing a nucleic acid construct comprising a 
gene fusion, wherein the gene fusion comprises a region encoding an oil-body- 
protein or an active fragment thereof, operably linked to a region encoding a first 
thioredoxin-related protein or an active fragment thereof. 

30 219. The plant of claim 218, wherein the thioredoxin-related protein is 

thioredoxin. 
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220. The plant of claim 219, wherein said thioredoxin is selected from 
the group consisting of SEQ ID NOs:38, 42, 46, 50 and SEQ ID NOs:52-194. 

221 . The plant of claim 219, wherein the thioredoxin is derived from 

Arabidopsis or wheat. 
5 222. The plant of claim 218, wherein the thioredoxin-related protein is 

thioredoxin-reductase. 

223. The plant of claim 222, wherein said thioredoxin-reductase is 
selected from the group consisting of those set forth in SEQ ID NOs:8, 9, 10, 
40, 44, 48, 50 and SEQ ID NOs:195-313. 
10 224. The plant of claim 222, wherein the thioredoxin-reductase is an 

NADPH-dependent thioredoxin-reductase. 

225. The plant of claim 218, wherein the construct is contained within a 

plastid. 

226. The plant of claim 218, wherein the first thioredoxin-related protein 
15 is thioredoxin and wherein said construct further comprises a region encoding a 

thioredoxin-reductase. 

227. The plant of claim 226, wherein the thioredoxin and thioredoxin- 
reductase is obtained from Mycobacterium leprae. 

228. The plant of claim 226, wherein the thioredoxin-related protein is 

20 an engineered fusion protein. 

229. The plant of claim 218, wherein the first region precedes, in a 5' to 

3' direction, the second region. 

230. The plant of claim 218, wherein the first region follows, in a 5' to 

3' direction, the second region. 
25 231 . The plant of claim 21 8, wherein the gene fusion further comprises 

a third region encoding a second thioredoxin-related protein or an active 
fragment thereof, operably linked to the first region, or to the second region, or 
to both. 

232. The plant of claim 218, further comprising a seed-specific promoter 

30 operably linked to the gene fusion. 

233. The plant of claim 232, wherein the promoter is a phaseolin 

promoter. 
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234. The plant of claim 218, wherein the thioredoxin-related protein is 
derived from a plant species selected from the group consisting of Arabidopsis 
and wheat. 

235. The plant of claim 218, wherein the thioredoxin-related protein is 

5 derived from £. co//\ 

236. The plant of claim 218 further comprising a nucleic acid effective 
as a termination region in plant cells, operably linked to the gene fusion. 

237. The plant of claim 218, wherein the gene fusion further comprises 
a nucleic acid sequence encoding an oil-body-surface-avoiding linker amino acid 

10 sequence, wherein the nucleic acid encoding the linker amino acid sequence is 
positioned between the region encoding an oil-body-protein and the region 
encoding a first thioredoxin-related protein. 

238. The plant of claim 237, wherein said oil-body-surface-avoiding 
linker amino acid sequence is substantially negatively charged, or has a 

15 molecular weight of at least 35 kd. 

239. The plant of claim 238, wherein the gene fusion further comprises 
a linker nucleic acid sequence encoding an amino acid sequence that is 
specifically cleavable by an enzyme or a chemical, wherein the linker sequence is 
positioned between the oil-body-surface-avoiding linker amino acid sequence and 

20 the region encoding a first thioredoxin-related protein. 

240. The plant of claim 218, wherein a region of the gene fusion 
comprises a plurality of codons, each codon specifying a single amino acid, 
wherein at least one of the codons is modified from a naturally occurring codon 
within the region. 

25 241 . The plant of claim 240, wherein the modified codon specifies the * 

same amino acid as the naturally occurring codon, and wherein the codon is 
modified according to a codon preference of a plant. 

242. The plant of claim 240, wherein the modified codon specifies an 
amino acid that is different from the amino acid specified by the naturally 

30 occurring codon. 
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243. The plant of claim 218, wherein the plant is selected from the 
group consisting of Arabidopsis and safflower. 

244. A transgenic plant comprising a nucleic acid construct a seed- 
specific promoter operably linked to a gene fusion, wherein the gene fusion 

5 comprises a region encoding an oil-body-protein or an active fragment thereof, 
operably linked to a region encoding a first thioredoxin-related protein or an 
active fragment thereof, wherein a fusion protein comprising activities of oleosin 
and the thioredoxin-related protein is produced in a seed of the plant. 

245. The transgenic plant of claim 244, wherein the plant is selected 
10 from the group consisting of Arabidopsis and safflower. 

246. The transgenic plant of claim 244 wherein the promoter is "a 

phaseolin promoter. 

247. The seed of the plant of claim 244. 

248. The seed of claim 247, comprising a thioredoxin-related protein in a 
15 concentration of at least about 0.5% of total cellular seed protein. 

249. An extract of the seed of claim 247, wherein the extract comprises 
an activity of a thioredoxin-related protein. 

250. An oil body from the seed of claim 247. 

251 . Oil produced from the seed of claim 247. 

20 252. A method of making a fusion protein comprising a thioredoxin- 

related activity, the method comprising the steps of: 

providing a transgenic plant comprising a nucleic acid construct 

comprising a seed-specific promoter operably linked to a gene 
fusion, wherein the gene fusion comprises a region encoding an 
25 oil-body-protein or an active fragment thereof, operably linked to a 

region encoding a first thioredoxin-related protein or an active 
fragment thereof, the gene fusion encoding a fusion protein 
comprising a thioredoxin-related activity; 
obtaining seeds from the plant; and 
30 recovering the fusion protein by isolating oil bodies from the seeds. 

253. The method of claim 252, further comprising the step of 
fractionating the oil bodies to achieve partial purification of the fusion protein. 
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254. Oil bodies in association with a fusion protein, obtained by the 

method of claim 252. 

255. The method of claim 252 further comprising a step of cleaving the 
oil-body-protein from the thioredoxin-related protein after fractionation of the oil 

5 bodies. 

256. The method of claim 255, wherein the cleaving step comprises use 
of a protease. 

257. The method of claim 255, wherein the cleaving step comprises 
chemical proteolysis. 

10 258. A method of reducing allergenicity of a food comprising the steps 

of: 

providing a preparation comprising oil bodies associated with a 
fusion protein, the fusion protein comprising an oil-body-protein or an 
active fragment thereof and a thioredoxin-related protein or an active 

15 fragment thereof; and 

adding the preparation to the food, whereby allergenicity of the 
food is reduced due to activity of the thioredoxin-related protein or 
fragment. 

259. The method of claim 258, wherein the food is selected from the 
20 group consisting of wheat flour, wheat dough, milk, cheese, yogurt and ice 

cream. 

260. The method of claim 258, further comprising providing NADH as a 
co-factor in the substantial absence of NADPH. 

261. A composition comprising a fusion protein, the fusion protein 

25 comprising an oil-body-protein or an active fragment thereof and a thioredoxin- 
related protein or an active fragment thereof, in a pharmaceutical^ acceptable 
carrier. 

262. The composition of claim 261, further comprising oil bodies in 
association with the fusion protein. 

30 263. A cosmetic formulation comprising' oil bodies associated with a 

fusion protein, the fusion protein comprising an oil-body-protein or an active 
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fragment thereof and a thioredoxin-related protein or an active fragment thereof, 
in a pharmaceutical^ acceptable carrier. 

264. A method of treating or protecting a target against oxidative stress, 

comprising the steps of: 
5 providing a preparation comprising a fusion protein, the fusion 

protein comprising an oil-body-protein or an active fragment thereof and a 
thioredoxin-related protein or an active fragment thereof; and 

contacting the preparation with a target, wherein the target is 
susceptible to oxidative stress, thereby treating or protecting against the 

10 stress. 

265. The method of claim 264, wherein the target is selected from the 
group consisting of a molecule, a molecular complex, a cell, a tissue, and an 
organ. 

266. A nucleic acid construct comprising a gene fusion, wherein the 
15 gene fusion comprises a first region encoding an oil-body-protein or an active 

fragment thereof, operably linked to a second region encoding at least one 
polypeptide or an active fragment thereof, and an oil-body-surface-avoiding linker 
in frame between the first and second region polypeptides. 
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HE TEROMTJLTIMERS 



Class 



Biosynthetic 



Biosyathctic 



Heteromultimer 



Example sequence reference tor 
heteromultimeric subunits 



"~3-methyl-2-oxobutanoate 
dehydrogenase (2-oxoisovalerate 
dehydrogenase (Hpoamide))- El 
component) 



McKean, et al Biochim. Biophys. Acta (1992) 
1171:109-112 / Cuuang,LL. s et al. FEES Lett, a 
(1990) 262 (2), 305-309. 



3-oxoadipate Co A-transf erase 



Paraies, R.E. and Harwood, S.C. J. Bactcriol. (1992) 
174:4657-4666 



Biosynthetic 



Biosynthetic 
Biosynthetic 
Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



beta-ketoacyl-[acyl carrier protein] 

synthase I 

butyrate— acetoacetate CoA-transferase 



Fischer,RJ., et al J. Bacteriol. (1993) 175.(21), 
6959-6969. 



cAMP dependent protein kinase 



Mutzel, R et al. Proc. Natl. Acad. Sci. U.SA. (1987) 
84:6-10./ Burki,E., et al Gene (1991) 102 (1), 57- 
65. 



carbamoyk-phoaphate synthase 



Shigenobu,S., et al. Nature. (2000) 407 (6800), 81- 
86. 



Creatine kinase 



garnma-glutamyltransferase (garama- 
glutamyl transpeptidase) 



Billadello, J.J.; et al. Biochem. Biophys. Res. 
Coramun. (1986) 138:392-398. / Roman, D.; et al 
Proc. Natl. Acad. Sci. U.S.A (1985). 82:8394-8398. 
Papandrikopoulou, A.; et al. Eur. J. Biochem 
(1989) 183:693-698. 



Morrow, C.S. et al Gene (1989) 75:3-1 1 



Cole, S.T. etal J. Bacteriol. (1988) 170:2448-2456. 



guanylatc cyclase 



Hinsch, K.D. etal FEBS Lett. (1988) 239:29-34/ 
Koesling, P. et al FEBS Lett. (1990) 266:128-132. 



heterodisulfide reductase 



Smith,D.R., et al X. Bacteriol. (1997) 179 (22), 
7135-7155. 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Hydiogenase 



Meprin A 



Johnson, G.D. and Hersh, L.B. J. Biol. Chem. 
(1992) 267:13505-13512. 



methionine adenosyltransf erase 



Horikawa, S.; Tsukada, K. FEBS Lett. (1992) 
312:37-41. 



Na+/K+-exchanging ATPase 



TvfAD(+)-depen(ient isocitrate 
dehydrogenase 



phosphoribosylformylglycinamidine 
synthase 



protocatechuate 3,4-dioxygenase 



Shull,G.E. f et al Biochemistry (1986) 25 (25), 
8125-8132./Mercer,R.W., etal. 
Mol. Cell. Biol. (1986) 6 (11), 3884-38907 . 
Mercer,R.W., etal J. Cell Biol. (1993) 121 (3), 
579-586. 



Cupp, J.R. and McAlister-Henn, L. J. Biol. Chem. 
(1992) 267:16417-16423. /Cupp, LR. and 
McAlister-Henn, L. 

J. Biol. Chem. (1991) 266:22199-22205. 

Ebbole, D.J.; Zalkin, H. J. Biol. Chem. (1987) 
262:8274-8287. 



Frazee, R.W.; etal. J. Bacteriol. (1993) 175:6 jy4- 
6202. 



Engelkamp, P.; etal. Biochemistry (1992) 
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Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosynthetic 



Biosythetic 



Calcium binding 



Calcium binding 



Calcium binding 



DNA binding 



DNA binding 



DNA bmding 
DNA binding 



31:10258-10264. / Allore, R.J.; et al J. Biol. Chem. 

(1990) 265:15537-15543. 

sucrose--fructan 6-fructosyltransferase I Sprenger, N.; et al Proc. Natl. Acad. Sci. U.S.A. 

(1995) 92:11652-11656. 



Superoxide dismutase 



Capo, C.R.; et al Biochem. Biophys. Res. 
Commun. (1990) 173:1186-1193. 
Labigne, A.; et al J. BacterioL (1991) 173:1920- 

1931. . 

uxokdnase-type plasminogen activator I Belin, D. et al Eur. J. Biochem. (1985) 148:225- 
(urokinase) I 232 



Urease 



methylmalonyl-coenzyme A mutase 



Birch,A., etal. J. BacterioL (1993) 175 (11), 3511- 
3519. 



Calcineurin 



Muramatsu.T. and Kincaid,R.L. Biochim. Biophys. 
Acta (1993) 1178 (1), 117-120 / Guerini, D. et al 
DNA (1989) 8:675-682. 



Calgranulin 



Imamichi, T. et al Biochem. Biophys. Res. 
Commun. (1993) 194:819-825.. 



Calpain 



Aoki, K. et al FEBS Lett. (1986) 205:313-317. 



API 



van Straaten.F., et al Proceedings of the National 
Academy of Sciences of the United States of 
America.' (1983) 80 (11), 3183-3187. /Hattori,K., et 
al Proceedings of the National Academy of Sciences 
of the United States of America. (1988) 85 (23), 
9148-9152. 



cMyc-Max 



Schreiber-Agus,N et al Moi. Cell. Biol. (1993) 13 
(5), 2765-2775. 



DNA binding protein HU-l/HU-2 
hepatic nuclear factor 1 



Laine, B. et al Eur. J. Biochem. (1980) 103:447- 

461. __. 

BachJ et al Nucleic Acids Res. (1992) 20 (16), 
4199-4204. / Rey-Campos,J. et al EMBO J. (1991) 
10 (6), 1445-1457 



DNA binding , 



Integration host factor 



Miller,H.I. Cold Spring Harbor symposia on 
quantitative biology. (1984) 49, 691-698. / Flamm, 
E. and Weisberg, ILA. J. Mol. Biol. (1985) 183:117- 
128. 



DNA binding 



Ku 



DNA binding 
DNA binding 



MutS 
NF-E2 



Reeves,W.H. and Sthoeger,Z.M. J. Biol. Chem. 
(1989) 264 (9), 5047-5052. / 
J. Biol. Chem. (1989) 264 (23), 13407-1341 1. 
Bocker et al 1999. Cancer Research 59, 816-822 
Chan,J.Y. et al. Proc. Natl. Acad. Sci. U.S.A. 
(1993) 90 (23), 11366-113707 Toki,T., etal 
Oncogene (1997) 14 (16), 1901-1910. 



DNA binding 



nuclear factor kB (NFkB) 



Electron transport 
Electron transport 



corrinoid/iron-suifur protein 
cytochrome d ubiquinol oxidase 



Kieran M, et al Cell. (1990) Sep 7;62(5):1007-18. 
Ruben SM, et al Science (1991) Mar 
22;251(5000):1490-3. 

Erratum in: Science (1991) Oct 4;254(5028):1 1 
Lu, W.P. et al J. Biol. Chem. (1993) 268:5605- 
5614. 



/ 



Green, 51 et al J. Biol. Chem. (1988) 263:13138- 
13143. 



Electron transport 



Electron transport 



cytochrome-c3 hydrogenase 



Menon, N.K. et al J. BacterioL (1987) 169:5401- 
5407. ■ 



electron transfer flavoprotein 



Finocchiaro, G. et al Biol. Chem. (1988) 
263:15773-15780. / Finocchiaro, G. etal Eur. J. 
Biochem. (1993) 213:1003-1008. 
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Electron transport 


xylene monooxygenase 


Shaw, J.P. and Harayama, S. Eur. J. Biochem. 
(1992) 209:51-61. / Kasai,Y., et al J. Bacteriol. 
(2001 183 (22\ 6662-6666 


Growth factor 


hepatocyte growth factor 


Nakamura, T. et al Nature (1989) 342:440-443. . 


Growth factor 


human chorionic gonadotropin 


Morgan,F.J. et al J. Biol. Chem, (1975) 250 (13), 
5247-5258. 


Growth factor 


Platelet-derived growth factor 


Takimoto,Y., et al Hiroshima J. Med. Sci. (1993) 
4z {i)t 4/oz./ JosepnSjO.r ei at. ocience ^li/o^j 
225 (4662), 636-639. 


Hormone 


Bombyxin 


Adachi, T. et al J. Biol. Chem. (1989) 264:7681- , 
7685. 


Hormone 


Follicle stunulatmg hormone 


Fiddes,J.C. and uooaman,JH..M. J. Mol. Appl. 
Genet. (1981) 1 (1), 3-18. / Watkins,P.C, etal 
DNA (1987) 6 (3), 205-212. 


Hormone 

■ 


Insulm 


Bell,G.L, Pictet,R.L., Kutter,W.J., Cordell,**., 
Tischer,E. and Goodman,H.M. 
Sequence of the human insulin gene. Nature. 284 
(5751), 26-32 (1980) 


Hormone 


Luteinizing Hormone 


riddes,J.C. and Ooooxnan,ti.M. J. Jvlol. Appl. 
Oenet. (lyoi) 1 (1), J-lo. / onome,r>. ana 
Parlow,A.F. J. Clin. Endocrinol. Metab; (1973) 36 
(3), olo-oZl. 


Hormone 


Thyroid stimulating hormone 


Fiddes,J.C. and Goodman,H.M. J. Mol. Appl. 
Genet. (1981) 1 (1), 3-18. /Hay ashizatri Y, et al 

TTRRQ T *»++ f1QR*i\ 1 RR CY\ A(\H 
rCDo -Lett, ^lyo-jj loo jy*k-H\J\J. 


Immune 


B-cell antigen receptor complex 


Hashimoto,S. etal J. Immunol. (1993) 150 (2), 491- 
498. / Flaswinkel 3 H. and Reth,M. 
lmmuno generics ->o zoo-zuy. 


Immune 


Cell surface CJJo molecules 


ureta- v idai,A., fl/. immunogenencs ^lyyyj 

H o\ 715? TJ 1 


Immune 


human complement subcomponent Clq 


Sellar, G.C. era/. Biochem. J. (1991) 274:481-490. 


Immune 


T cell receptor 


l aiicen,jD.j^. et at. ocano. j. immunoi. ^uui j j** ^i- 
2), 204-210. 


Photosynthesis ~ 


C-phycocyanin 

■ 


Ofmer, G.D, a/. J. Biol. Chem. (1981) 256:12167- 
lzi to. 1 iroxier, iv.! 4 . ei at. j. ojui. uncin. ^iiroi^ 
256:12176-12184. 


Photosynthesis 


ferroredoxin-thioredoxin reductase 


Chow, L.P. ef a/. Eur. J. Biochem. (1995) 231:149- 
156. / Iwadate, H. et al Eur. J. Biochem. (1994) 

Z.Z. D .tOJ-*T / 1 . 


Photosynthesis 


Light harvesting complex I 


Proc. Natl. Acad. Sci. U.S.A. (1984) 81, 189-192. 


Photosynthetic 


cytochrome b559 


Carrillo, N. etal Curr Genet. 1986;10(8):619-24. 


Protease 


A 1 Jr -dependent v-/ip protease 


aaTtU TT at nl Client* /"IQO^ 1 R1 «77 R^ /lflinQtF 

vjerui, vj. et at. uene ^ii/i/o^ 101./ /-oj. f jvuiioi,j'. 
era/ Nature (1997) 390 (6657\ 249-256. 


Receptor 


-alpha-2-macroglobulin receptor 


Strickland, D.K. et al J. Biol. Chem. (1990). 
265:1740 L-17404. / Strickland, D.K. et al J. Biol. 
Chem. (1991) 266:13364-13369. 


Receptor 


Interleukin-2 receptor . 

t 


Ishida, N. et al Nucleic Acids Res. (1985) 13:7579- 
7589./ Hatakeyama, M. etal Science (1989) 
244:551-556 / Takeshita, T. et al Science (1992) 
257:379-382. 


Receptor 


platelet-derived growth factor receptor 


Lee, K.H. et al Mol. Cell. Biol. (1990) 10:2237- 
2246. / Herren,B. et al Biochim. Biophys. Acta 
1173 (3), 294-302(1993). 


Structural 


Hemoglobin 


HeindelULC. et al CeU (1978) 15 (1), 43-54. / 
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■ 


BestJ.S. et al. Hoppe-Seyler's Z. Physiol. Chem. 
(1989) 350 (5), 563-580. /Hardison,RX. J. Biol. 
Chem, (1981) 256 (22), 11780-11786. 


Structural 


human platelet glycoprotein lb 


Wenger,R.H. et al. Biochem. Biophys. Res. 
Commun. (1988) 156 (1), 389-395. / Yagi,M. etal 
J. Biol. Chem. (1994) 269 (26), 17424-17427. 


Structural 


Plasma fibronectin 


Kornblihtt, A.R. et al. Proc. Natl. Acad. Sci. U.S A. 
(1983) 80:3218-3222. 


Structural 


Spectrin 


Sahr, K.E. et al J. Biol. Chem. (1990) 265:4434- 
4443. / Winkelmann, J.C. et al. J. BioL Chem. 
(1990)265:11827-11832. 


Structural 


Tubulin 


Ponstmgl, H. a/. Proc. Natl. Acad. Sci. U.S.A. 
(1981) 78:2757-2761. / Krauhs, E. er a/. Proc. Natl. 
Acad. Sci. U.S.A. (1981) 78:4156-4160. 


Toxin 


Agkisacutacin 


Cheng,X. e/ a/. Biochem. Biophys. Res. Commun. 
(1999) 265 (2), 530-535. 


Toxin 


Beta bungaro toxins 


Kondo, K. fl/. J. Biochem. (1978) 83:101-1 15. 


Toxin 


Crotoxin 


Bouchier,C. et al Nucleic Acids Res. (1988) 16 
(18), 9050. 


Toxin 


Mojave toxin 


John, T.R. et al Gene (1994) 139:229-234. 


Toxin 


venom protein C9S3 


Rowan, E.G. et al. Nucleic Acids Res. (1990) 
1 8: 1 639. / Joubert, F.J. and Viljoen, C.C. Hoppe- 
Seyler's Z. Physiol Chem. (1979) 360:1075-1090. 


Miscellaneous 


Inhibin 


Forage, R.G. et al. Proc. Natl. Acad. Sci. U.S.A. 
(1986) 83:3091-3095. 


Miscellaneous ! 


Monellin 


Frank, G. and Zuber, H. Hoppe-Seyler's Z. Physiol. 
Chem. (1976) 357:585-592. 


Miscellaneous 


mRNA capping enzyme 


Niles, E.G. etal., J. Virology (1986) 153:96-112. 


Miscellaneous 


Soybean insulin-binding protein si30 


Barbashov, S.F. et al Bioorg, Khim. (1991) 17:421- 
423. 
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SEQUENCE LISTING 

<110> SemBioSys Genetics, Inc. 

Syngenta Participations AG 

<12 0> METHODS FOR THE PRODUCTION OF MULTIMERIC PROTEINS, AND RELATED 
COMPOSITIONS 

<130> 38814-351PC 

<140> Not Yet Assigned 
<141> Herewith 

<160> 313 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 1 

taccatggct tcggaagaag ga 22 

<210> 2 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Primer 
<400> 2 

gaaagcttaa gccaagtgtt tg 22 

<210> 3 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Primer 
<400> 3 

ggccagcaca ctaccatgaa tggtctcgaa actcac 3 6 

<210> 4 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 4 

ttaagcttca atcactctta ccttgctg 2E 

<210> 5 

<211> 72 

<212> DNA 

<213> Artificial Sequence 
<220> 



-1- 
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<223> Primer 



<400> 5 

actggagatg ttgactcgac ggatactacg 
gtgatcgcct gc 

<210> 6 
<211> 80 
<212> DNA 

<213> Artificial Sequence 



gattggtcga cggctatgga agaaggacaa 6 0 



<220> 

<223> Primer 



atccgtcgag tcaacatctc cagtttcctc ggtggtctcg ttagccttcg atccagcaat 60 
ctcttgtaag aatgctctgc 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> Primer 



<400> 7 

gtggaagctt atggagatgg ag 

<210> 8 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Chimeric 



<400> 8 



atgaatggtc tcgaaactca caacacaagg ctctgtatcg taggaagtgg cccagcggca 60 
caLcgglgg cgatttacgc agctagggct gaacttaaac ctcttctctt ^gaaggatgg 120 
atggctaacg acatcgctcc cggtggtcaa ctaacaacca ccaccgacgt cgagaatttc 180 
cccgga^ttc cagaafgtat tctcggagta gagctcactg acaaattccg taaacaatcg 240 
gagcgattcg gtactacgat atttacagag acggtgacga aagtcgattt ctcttcgaaa 3 00 
ccgtftaagc tattcacaga ttcaaaagcc attctcgctg acgctgtgat ^tcgctact 360 
ggagctgtgg ctaagcggct tagcttcgtt ggatctggtg aaggttctgg aggtttctgg 420 
ILcgtlgIS tctccgcttg tgctgtttgc gacggagctg ctccgatatt ccgtaacaaa 480 
cctcttgcqg tgatcggtgg aggcgattca gcaatggaag aagcaaactt tcttacaaaa 540 
tatgga?cta aagtgtatat aatccatagg agagatgctt ttagagcgtc taagattatg 600 
cagcagcgag ctttgtctaa tcctaagatt gatgtgattt ggaactcgtc tgttgtggaa 660 
gcttatggal atgglgaaag agatgtgctt ggaggattga aagtgaagaa tgtggttacc 720 
Igagatlttt ctptttaai agtttctgga ttgttctttg ctattggtca tgagccagct 780 
Iccaagtttt tggatggtgg tgttgagtta gattcggatg gttatgttgt cacgaagcct 840 
ggtactacac apctigcp tccclglgtt ttcgctgcgg gtgatgttca OT^taagaag 900 
tataggcaag ccatcactgc tgcaggaact gggtgcatgg cagctttgga tgcagagcat 360 
tacttacaag agattggatc tcagcaaggt aagagtgatt ga 

<210> 9 
<211> 999 
<212> DNA 

<213> Arabidopsis thaliana 

ataaatqqtc tcgaaactca caacacaagg ctctgtatcg taggaagtgg cccagcggca 
cacacglS cgatttacgc agctaggglt gaacttaaac ctcttctctt cgaaggatgg 
ataactaaca acatcgctcc cggtggtcaa ctcaaccaac caccgcgtga gaatttcccc 180 
qgatttccal aaggtlttct cggaltagag ctcactgaca aattccgtaa acaatcggag 240 
c|attcggt! ctlcgatatt tlcalaglcg gtgacgaaag tcgatttctc ttcgaaaccg 3 00 



60 

120 

180 
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tttaagctat tcacagattc aaaagccatt ctcgctgacg ctgtgattct cgctatcgga 360 

gctgtggcta agtggcttag cttcgttgga tctggtgaag ttctcggagg tttgtggaac 420 

cgtggaatct ccgcttgtgc tgtttgcgac ggagctgctc cgatattccg caacaaacct 480 

cttgcggtga tcggtggagg cgattctgca atggaagaag caaactttct tacaaaatat 540 

qgatctaaag tgtatataat cgataggaga gatgctttta gagcgtctaa gattatgcag 600 

cagcgagctt tgtctaatcc taagattgat gtgatttgga actcgtctgt tgtggaagct 660 

tatqgagatg gagaaagaga tgtgcttgga ggattgaaag tgaagaatgt ggttaccgga 720 

gatgtttctg atttaaaagt ttctggattg ttctttgcta ttggtcatga gccagctacc 780 

aagtttttgg atggtggtgt tgagttagat tcggatggtt atgttgtcac gaagcctggt 840 

actacacaga ctagcgttcc cggagttttc gctgcgggtg atgttcagga taagaagtat 900 

aggcaagcca tcactgctgc aggaactggg tgcatggcag ctttggatgc agagcattac 960 

ttacaagaga ttggatctca gcaaggtaag agtgattga 999 

<210> 10 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Chimeric 

<221> CDS 

<222> (1) . . . (1002) 

<2 23> cDNA encoding NADPH thioredoxin reductase 



48 



96 



<400> 10 

atg aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 
15 10 15 

qqc cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt 
Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 

aaa cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt 144 
Lys Pro Leu Leu Phe Glu Gly Tip Met Ala Asn Asp He Ala Pro Gly 
35 40 45 

ggt caa eta aca ace ace ace gac gtc gag aat ttc ccc gga ttt cca 
Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 
50 55 60 



192 



240 



288 



gaa ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg 
Glu Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

gag cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat 
Glu Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

85 90 95 

ttc tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc 336 
Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 

100 105 HO 

get gac get gtg att etc get act gga get gtg get aag egg ctt age 3 84 
Ala Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser 
115 120 125 

ttc gtt gga tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate 432 
Phe Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He 
130 135 140 

tec get tgt get gtt tgc gac gga get get ccg ata ttc cgt aac aaa 480 
Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 
145 150 155 160 

cct ctt gcg gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac 52 8 
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Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 

165 170 175 

ttt ctt aca aaa tat gga tct aaa gtg tat ata ate cat agg aga gat 576 
Phe Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp 

180 185 190 

get ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct 
Ala Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro 
195 " 200 205 

aag att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat 
Lys He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 
210 215 220 

gga gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc 
Gly Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 
225 230 235 240 

qqa gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt 
Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala lie Gly 
* 245 250 255 

cat gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg 
His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 

260 265 270 

gat ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc 
Asp Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro 
275 280 285 

gga gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gee 
Glv Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 
290 295 300 

ate act get gca gga act ggg tgc atg gca get ttg gat gca gag cat 
He Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His 
305 310 315 320 

tac tta caa gag att gga tct cag caa ggt aag agt gat tga 
Tvr Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp * 

325 330 



<210> 11 
<211> 333 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 

Met^Asn^Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 

15 10 I 5 

Glv Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 

35 40 45 

Glv Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 

50 55 60 

Glu Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

Glu Arq Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

85 90 95 

Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 



624 



672 



720 



768 



816 



864 



912 



960 



1002 
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100 

Ala Asp Ala Val 
115 

Phe Val Gly Ser 
130 

Ser Ala Cys Ala 
145 

Pro Leu Ala Val 

Phe Leu Thr Lys 

180 

Ala Phe Arg Ala 
195 

Lys lie Asp Val 
210 

Gly Glu Arg Asp 
225 

Gly Asp Val Ser 

His Glu Pro Ala 

260 

Asp Gly Tyr Val 
275 

Gly Val Phe Ala 
290 

lie Thr Ala Ala 
305 

Tyr Leu Gin Glu 



He Leu Ala Thr 

120 

Gly Glu Gly Ser 
135 

Val Cys Asp Gly 
150 

He Gly Gly Gly 
165 

Tyr Gly Ser Lys 

Ser Lys He Met 

200 

He Trp Asn Ser 
215 

Val Leu Gly Gly 
230 

Asp Leu Lys Val 
245 

Thr Lys Phe Leu 

Val Thr Lys Pro 

280 

Ala Gly Asp Val 
295 

Gly Thr Gly Cys 
310 

He Gly Ser Gin 
325 



105 

Gly Ala Val Ala 

Gly Gly Phe Trp 

140 

Ala Ala Pro lie 
155 

Asp Ser Ala Met 
170 

Val Tyr He He 
185 

Gin Gin Arg Ala 

Ser Val Val Glu 

220 

Leu Lys Val Lys 
235 

Ser Gly Leu Phe 
250 

Asp Gly Gly Val 
265 

Gly Thr Thr Gin 

Gin Asp Lys Lys 

300 

Met Ala Ala Leu 
315 

Gin Gly Lys Ser 
330 



110 

Lys Arg Leu Ser 
125 

Asn Arg Gly He 

Phe Arg Asn Lys 

160 

Glu Glu Ala Asn 
175 

His Arg Arg Asp 
190 

Leu Ser Asn Pro 
205 

Ala Tyr Gly Asp 

Asn Val Val Thr 

240 

Phe Ala He Gly 
255 

Glu Leu Asp Ser 
270 

Thr Ser Val Pro 
285 

Tyr Arg Gin Ala 

Asp Ala Glu His 

320 

Asp 



<210> 12 
<211> 332 
<212> PRT 

<213> Arabidopsis thai i ana 



<400> 12 
Met Asn Gly Leu 
1 

Gly Pro Ala Ala 

20 

Lys Pro Leu Leu 
35 

Gly Gin Leu Asn 
50 

Gly He Leu Gly 
65 

Arg Phe Gly Thr 

Ser Ser Lys Pro 

100 

Asp Ala Val He 
115 

Val Gly Ser Gly 
13 0 

Ala Cys Ala Val 
145 

Leu Ala Val He 

Leu Thr Lys Tyr 

180 

Phe Arg Ala Ser 
195 

He Asp Val He 
210 

Glu Arg Asp Val 
225 



Glu Thr His Asn 
5 

His Thr Ala Ala 

Phe Glu Gly Trp 

40 

Gin Pro Pro Arg 
55 

Val Glu Leu Thr 
70 

Thr He Phe Thr 
85 

Phe Lys Leu Phe 

Leu Ala He Gly 

120 

Glu Val Leu Gly 
135 

Cys Asp Gly Ala 
150 

Gly Gly Gly Asp 
165 

Gly Ser Lys Val 

Lys He Met Gin 

2 0 0 

Trp Asn Ser Ser 
215 

Leu Gly Gly Leu 
230 



Thr Arg Leu Cys 
10 

He Tyr Ala Ala 
25 

Met Ala Asn Asp 

Glu Asn Phe Pro 

60 

Asp Lys Phe Arg 
75 

Glu Thr Val Thr 
90 

Thr Asp Ser Lys 
105 

Ala Val Ala Lys 

Gly Leu Trp Asn 

140 

Ala Pro He Phe 
155 

Ser Ala Met Glu 
170 

Tyr He He Asp 
185 

Gin Arg Ala Leu 

Val Val Glu Ala 

220 

Lys Val Lys Asn 
235 



He Val Gly Ser 
15 

Arg Ala Glu Leu 
30 

He Ala Pro Gly 
45 

Gly Phe Pro Glu 

Lys Gin Ser Glu 

80 

Lys Val Asp Phe 

95 

Ala He Leu Ala 
110 

Trp Leu Ser Phe 
125 

Arg Gly He Ser 

Arg Asn Lys Pro 

160 

Glu Ala Asn Phe 
175 

Arg Arg Asp Ala 
190 

Ser Asn Pro Lys 
205 

Tyr Gly Asp Gly 

Val Val Thr Gly 

240 
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Asp Val Ser Asp 

Glu Pro Ala Thr 

260 

Gly Tyr Val Val 
275 

Va3, Phe Ala Ala 
290 

Thr Ala Ala Gly 
305 

Leu Gin Glu lie 



Leu Lys Val Ser 
245 

Lys Phe Leu Asp 

Thr Lys Pro Gly 

280 

Gly Asp Val Gin 
2 95 

Thr Gly Cys Met 
310 

Gly Ser Gin Gin 
325 



Gly Leu Phe Phe 
250 

Gly Gly Val Glu 
265 

Thr Thr Gin Thr 

Asp Lys Lys Tyr 

300 

Ala Ala Leu Asp 
315 

Gly Lys Ser Asp 
330 



Ala He Gly His 
255 

Leu Asp Ser Asp 
270 

Ser Val Pro Gly 
285 

Arg Gin Ala He 

Ala Glu His Tyr 

320 



<210> 13 
<211> 333 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Chimeric 



<400> 13 
Met. Asn Gly Leu 
1 

Gly Pro Ala Ala 

20 

Lys Pro Leu Leu 
35 

Gly Gin Leu Thr 
50 

Glu Gly He Leu 
65 

Glu Arg Phe Gly 

Phe Ser Ser Lys 

100 

Ala Asp Ala Val 
115 

Phe Val Gly Ser 
13 0 

Ser Ala Cys Ala 
145 

Pro Leu Ala Val 

Phe Leu Thr Lys 

180 

Ala Phe Arg Ala 
195 

Lys He Asp Val 
210 

Gly Glu Arg Asp 
225 

Gly Asp Val Ser 

His Glu Pro Ala 

260 

Asp Gly Tyr Val 
275 

Gly Val Phe Ala 
290 

He Thr Ala Ala 
305 

Tyr Leu Gin Glu 



Glu Thr His Asn 
5 

His Thr Ala Ala 

Phe Glu Gly Trp 

40 

Thr Thr Thr Asp 
55 

Gly Val Glu Leu 
70 

Thr Thr He Phe 
85 

Pro Phe Lys Leu 

He Leu Ala Thr 

120 

Gly Glu Gly Ser 
135 

Val Cys Asp Gly 
150 

He Gly Gly Gly 
165 

Tyr Gly Ser Lys 

Ser Lys He Met 

200 

He Trp Asn Ser 
215 

Val Leu Gly Gly 
230 

Asp Leu Lys Val 
245 

Thr Lys Phe Leu 

Val Thr Lys Pro 

280 

Ala Gly Asp Val 
295 

Gly Thr Gly Cys 
310 

He Gly Ser Gin 
325 



Thr Arg Leu Cys 
10 

He Tyr Ala Ala 
25 

Met Ala Asn Asp 

Val Glu Asn Phe 

60 

Thr Asp Lys Phe 

75 

Thr Glu Thr Val 
90 

Phe Thr Asp Ser 
105 

Gly Ala Val Ala 

Gly Gly Phe Trp 

140 

Ala Ala Pro He 
155 

Asp Ser Ala Met 
170 

Val Tyr He He 
185 

Gin Gin Arg Ala 

Ser Val Val Glu 

220 

Leu Lys Val Lys 
235 

Ser Gly Leu Phe 
250 

Asp Gly Gly Val 
265 

Gly Thr Thr Gin 

Gin Asp Lys Lys 

300 

Met Ala Ala Leu 
315 

Gin Gly Lys Ser 
330 



He 


Val 


Gly 


Ser 






15 




Arg 


Ala 


Glu 


Leu 




30 






He 


Ala 


Pro 


Gly 


45 








Pro 


Gly 


Phe 


Pro 


Arg 


Lys 


Gin 


Ser 








80 


Thr 


Lys 


Val 


Asp 






95 




Lys 


Ala 


He 


Leu 


110 






Lys 


Arg 


Leu 


Ser 


125 








Asn Arg 


Gly 


He 


Phe 


Arg 


Asn 


Lys 








160 


Glu 


Glu 


Ala 


Asn 






175 




His 


Arg 


Arg Asp 




190 






Leu 


Ser 


Asn 


Pro 


205 








Ala 


Tyr 


Gly 


Asp 


Asn 


Val 


Val 


Thr 








240 


Phe 


Ala 


He 


Gly 






255 




Glu 


Leu 


Asp 


Ser 




270 






Thr 


Ser 


Val 


Pro 


285 








Tyr 


Arg 


Gin 


Ala 


Asp 


Ala 


Glu 


His 








320 


Asp 
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<210> 14 
<211> 3129 
<212> DNA 

<213> Artificial Sequence 



<220> 
<221> 
<222> 



CDS 
(1555) 



(1899) 



<223> Chimeric 



<400> 14 

ctgcaggaat 

tggtttttta 

ttacttgtta 

tatccctaca 

ccgtctatct 

agcgttggta 

caatataaac 

ttatcaaata 

aatttaactc 

agccattttt 

aaatttcacc 

ataattccca 

cttctaaagt 

tattacttct 

aattgttgaa 

agtttaaagt 

tataagattt 

atgtaagtcc 

atgcatggat 

acaaaacgca 

catgtatgtc 

cccatctcaa 

tatatattca 

atgcgtgtca 

atacctataa 

actactctac 



tcattgtact 

cctctattta 

ctttaatttc 

aatttattat 

ttaatgtagt 

gaaagcataa 

aaattcttta 

tttttcaacc 

cataattttt 

ttctttgaat 

aaacaatcat 

tttgacacta 

aattttaata 

aaaaaattaa 

tttgtgacta 

aaatataagt 

atggtggact 

gtaactggaa 

gcttgcgcaa 

atcacacaac 

taaatgccat 

cccacacaca 

ttctcttccg 

tcccatgccc 

atacctctaa 

tactataata 



cccagtatca 

aaggggtttt 

tcataatctt 

ttgttaaaca 

ctaacatttt 

tgatttattc 

ccttaagaag 

acgtaaatct 

ttattcgact 

aaaaaaatcc 

ttgtggtatt 

cggaagtaac 

atagttacta 

ttagatataa 

ttgatttatt 

aatgtagtag 

aattttcata 

ttactgtggg 

gaaaaagaca 

caactcaaat 

gcaaagcaac 

aacacattgc 

ccacctcaat 

aaatctccat 

tatcactcac 

ccccaaccca 



ttatagtgaa 

ccacctaaaa 

tggttgaaat 

ttttcaaacc 

catattgaaa 

ttattcttct 

gatttcccat 

cataataata 

gatcttaaag 

aattatcatt 

tctgaagcaa 

tgaagatctg 

tattcaagat 

ttaaaatatt 

attctactat 

agtgttagag 

tatttcttat 

ttgccatggc 

aagaacaaag 

tagtcactgg 

acgtgcttaa 

ctttttcttc 

ttcttcactt 

gcatgttcca 

ttctttcatc 

actcatattc 



agttttggct 

attctggtat 

tatcacgctt 

gcataaaatt 

tatataattt 

tcatataaat 

tttatatttt 

agttgtttca 

caacacccag 

gtattttttt 

gtcatgttat 

cttttacatg 

ttcatatatc 

acttttttaa 

gtttaaattg 

tgttacccta 

tgcttttacc 

actctgtggt 

aaaaaagaca 

ctgatcaaga 

catgcacttt 

atcatcacca 

caacacacgt 

accaccttct 

atccatccat 

aatactactc 



ctctcgccgg 
cattctcact 
ccgcacacga 
ttatgaagtc 
acttaatttt 
gtttaatata 
aaaaatatat 
aaagt aat aa 
tgacacaact 
tatacaatga 
gcaaaattct 
cgagacacat 
aaatactcaa 
ttttaagttt 
ttttatagat 
aaccataaac 
ttttcttggt 
cttttggttc 
aaacagagag 
tcgccgcgtc 
aaatggctca 
caaccacctg 
caacctgcat 
ctcttatata 
ccagagtact 
tact atg 
Met 
1 



get teg gaa gaa gga caa gtg ate gec tgc cac acc gtt gag aca tgg 
Ala Ser Glu Glu Gly Gin Val He Ala Cys His Thr Val Glu Thr Trp 

5 10 15 

aac gag cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg gtt 
Asn Glu Gin- Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val Val 
20 25 30 

gat ttc acg get tct tgg tgt gga cca tgt cgt ttc ate get cca ttc 
Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe He Ala Pro Pne 
35 40 45 

ttt get gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag gtt 
Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val 
50 55 60 65 

gat act gat gaa ttg aag teg gtg gca agt gat tgg gcg ata cag gcg 
Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala He Gin Ala 

70 75 80 

atg cca acc ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa gtt 
Met Pro Thr Phe Met Phe Leu Lys Glu Gly Lys He Leu Asp Lys Val 

85 90 95 

gtt gga gec aag aaa gat gag ctt cag tct acc att gec aaa cac ttg 
Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr He Ala Lys His Leu 



60 

120 

180 

240 

300 

360 

420 

4B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1557 



1605 



1653 



1701 



1749 



1797 



1845 



1893 
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100 



105 



110 



get taa gcttaataag tatgaactaa aatgcatgta ggtgtaagag ctcatggaga 
Ala * 



1949 



gcatggaata 
ctatgaataa 
tatgataaat 
tcaaatagta 
ttgtgaacga 
catttatata 
aattataaag 
tacttatcca 
gttgatatgt 
atccttaaag 
ttggtttgat 
tatgtatata 
atctatacaa 
atttgacttt 
tateggcaag 
accaacttcc 
taatttgggt 
ttagcagtag 
caaagaataa 
ccaaaaacaa 



ttgtatccga 
acaaaggatg 
ttcctcttat 
caaaaacaaa 
gacataagtg 
ttatatatta 
agagaagttt 
cttatttaat 
atatgaaagg 
tgggtctatt 
aaaatattga 
aatttattat 
tegtttagee 
ttggttattt 
gaaataaaat 
acaggaaggt 
tgtcttgttt 
agcaatggtt 
ataaaataaa 
gtttcctagc 



ccatgtaaca 
ttatgatata 
tattataaat 
tgtgtactat 
ttaagaagac 
cccacttatg 
gtatccattt 
gtctttataa 
gtactatttg 
taattttatt 
aggatttaaa 
aatataacat 
ttgctggacg 
aacaaattat 
taaattagga 
caggtegggg 
getgeataat 
gaccgtgtgc 
atgagacact 
accctaccaa 



gtataataac 
ttaacactct 
catctgaatc 
aagactttct 
ataacaatta 
tattatatta 
atatattata 
ggtttgatcc 
aactctctta 
gcttcttaca 
ataataataa 
ttatctataa 
actctcaatt 
tatttaacac 
gggacaatgg 
acaacaaaaa 
ttatgcagta 
ttagcttctt 
tcagggatgt 
ctaaggtacc 



tgagctccat 
atetatgeae 
gtgaeggett 
aaacaattct 
taatggaaga 
ggatgttaag 
tactacccat 
atgatatttc 
ctctgtataa 
gataaaaaaa 
ataataaata 
aaaagtaaat 
atttaaacga 
tatatgaaat 
tgtgtcccaa 
aacaggcaag 
aaacactaca 
ttattttatt 
ttcaaccctt 



ctcacttctt 2009 
cttattgttc 2069 
atggaatgct 212 9 
aactttagca 2189 
agtttgtctc 2249 
gagacataac 2309 
ttatatatta 2369 
taatatttta 2429 
aggttggatc 24 8 9 
aaattatgag 2 549 
acatataata 2 609 
attgtcataa 2669 
gagtaaacat 2729 
tttttttttt 2789 
tccttataca 2849 
ggaaattttt 2909 
cataaccctt 2969 
tttttatcag 3029 
atacaaaacc 3 0 89 

3129 



<210> 
<211> 
<212> 
<213> 



15 

114 

PRT 

Artificial 



Sequence 



<220> 

<223> Chimeric 



<400> 15 
Met Ala Ser Glu 
1 

Trp Asn Glu Gin 

20 

Val Asp Phe Thr 
35 

Phe Phe Ala Asp 
50 

Val Asp Thr Asp 
65 

Ala Met Pro Thr 

Val Val Gly Ala 

100 

Leu Ala 



Glu 


Gly 


Gin 


Val 


5 








Leu 


Gin 


Lys 


Ala 


Ala 


Ser 


Trp 


Cys 








40 


Leu 


Ala 


Lys 


Lys 






55 




Glu 


Leu 


Lys 


Ser 




70 






Phe 


Met 


Phe 


Leu 


85 








Lys 


Lys 


Asp 


Glu 



lie Ala Cys His 
10 

Asn Glu Ser Lys 
25 

Gly Pro Cys Arg 

Leu Pro Asn Val 

60 

Val Ala Ser Asp 
75 

Lyg Glu Gly Lys 
90 

Leu Gin Ser Thr 
105 



Thr 


Val 


Glu 


Thr 






15 




Thr 


Leu 


Val 


Val 




30 






Phe 


He 


Ala 


Pro 


45 








Leu 


Phe 


Leu 


Lys 


Trp 


Ala 


He 


Gin 








80 


He 


Leu Asp 


Lys 






95 




He 


Ala 


Lys 


His 




110 







<210> 16 
<211> 3888 
<212> DNA 

<213> Artifcial sequence 
<220> 

<223> Chimeric 
<221> CDS 

<222> (1555) . . . (1907) 
<=221> CDS 

<222> (2148) . . . (2659) 
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<400> 16 

ctgcaggaat 

tggtttttta 

ttacttgtta 

tatccctaca 

ccgtctatct 

agcgttggta 

caatataaac 

ttatcaaata 

aatttaactc 

agccattttt 

aaatttcacc 

ataattccca 

cttctaaagt 

tattacttct 

aattgttgaa 

agtttaaagt 

tataagattt 

atgtaagtcc 

atgcatggat 

acaaaacgca 

catgtatgtc 

cccatctcaa 

tatatattca 

atgcgtgtca 

atacctataa 

actactctac 



tcattgtact 
cctctattta 
ctttaatttc 
aatttattat 
ttaatgtagt 
gaaagcataa 
aaattcttta 
tttttcaacc 
cataattttt 
ttctttgaat 
aaacaatcat 
tttgacacta 
aattttaata 
aaaaaattaa 
tttgtgacta 
aaatataagt 
atggtggact 
gtaactggaa 
gcttgcgcaa 
atcacacaac 
taaatgccat 
cccacacaca 
ttctcttccg 
tcccatgccc 
atacctctaa 
tactataata 



cccagtatca 
aaggggtttt 
tcataatctt 
ttgttaaaca 
ctaacatttt 
tgatttattc 
ccttaagaag 
acgtaaatct 
ttattcgact 
aaaaaaatcc 
ttgtggtatt 
cggaagtaac 
atagttacta 
ttagatataa 
ttgatttatt 
aatgtagtag 
aattttcata 
ttactgtggg 
gaaaaagaca 
caactcaaat 
gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



ttatagtgaa 
ccacctaaaa 
tggttgaaat 
ttttcaaacc 
catattgaaa 
ttattcttct 
gatttcccat 
cataataata 
gatcttaaag 
aattatcatt 
tctgaagcaa 
tgaagatctg 
tattcaagat 
ttaaaatatt 
attctactat 
agtgttagag 
tatttcttat 
ttgccatggc 
aagaacaaag 
tagtcactgg 
acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
actcatattc 



agttttggct 
attctggtat 
tatcacgctt 
gcataaaatt 
tatataattt 
tcatataaat 
tttatatttt 
agttgtttca 
caacacccag 
gtattttttt 
gtcatgttat 
cttttacatg 
ttcatatatc 
acttttttaa 
gtttaaattg 
tgttacccta 
tgcttttacc 
actctgtggt 
aaaaaagaca 
ctgatcaaga 
catgcacttt 
atcatcacca 
caacacacgt 
accaccttct 
atccatccat 
aatactactc 



ctctcgccgg 60 
cattctcact 120 
ccgcacacga 180 
ttatgaagtc 240 
acttaatttt 300 
gtttaatata 3 60 
aaaaatatat 420 
aaagtaataa 4 80 
tgacacaact 540 
tatacaatga 600 
gcaaaattct 660 
cgagacacat 720 
aaatactcaa 780 
ttttaagttt 840 
ttttatagat 900 
aaccataaac 960 
ttttcttggt 1020 
cttttggttc 1080 
aaacagagag 114 0 
tcgccgcgtc 1200 
aaatggctca 1260 
caaccacctg 1320 
caacctgcat 1380 
ctcttatata 1440 
ccagagtact 1500 
tact atg 1557 
Met 
1 



gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 
Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin 

5 10 15 

tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 
Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 
20 ' 25 30 

gga tct gac tac tec aag tct agg cag att get aaa get gca act get 
Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lye Ala Ala Thr Ala 
35 " 40 45 

gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 
Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 
50 ~ 55 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 
Gly Thr Val lie Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 

70 75 80 

age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 
Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 

85 90 95 

ggt ttt ctt tec tct gga ggg ttt ggc att gec get ata acc gtt ttc 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe 
100 105 HO 

tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 
Ser Trp He Tyr Lys 
115 



1605 



1653 



1701 



1749 



1797 



1845 



1893 



1947 



atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2007 
atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2067 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2127 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 217 8 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
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120 125 

gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 2226 
Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 22 74 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
145 , 150 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg get teg 23 22 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Ala Ser 

165 170 175 

gaa gaa gga caa gtg ate gee tgc cac acc gtt gag aca tgg aac gag 2370 
Glu Glu Gly Gin Val lie Ala Cys His Thr Val Glu Thr Trp Asn Glu 

180 185 190 

cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg gtt gat ttc 2418 
Gin Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val Val Asp Phe 
195 200 205 

acg get tct tgg tgt gga cca tgt cgt ttc ate get cca ttc ttt get 2466 
Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe Xle Ala Pro Phe Phe Ala 
210 * 215 220 

gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag gtt gat act 2514 
Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val Asp Thr 
225 230 235 240 

gat gaa ttg aag teg gtg gca agt gat tgg gcg ata cag gcg atg cca 2562 
Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala lie Gin Ala Met Pro 

245 250 255 

acc ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa gtt gtt gga 2610 
Thr Phe Met Phe Leu Lys Glu Gly Lys lie Leu Asp Lys Val Val Gly 

260 * 265 270 



gee aag aaa gat gag ctt cag tct acc att gee aaa cac ttg get taa 2658 
Ala Lys Lys Asp Glu Leu Gin Ser Thr lie Ala Lys His Leu Ala * 
275 280 285 

g cttaataagt atgaactaaa atgcatgtag gtgtaagagc tcatggagag 2709 
catggaatat tgtatccgac catgtaacag tataataact gagctccatc tcacttcttc 2769 
tatgaataaa caaaggatgt tatgatatat taacactcta tctatgcacc ttattgttct 2 82 9 
atgataaatt tcctcttatt attataaatc atctgaatcg tgaeggctta tggaatgett 2 889 
caaatagtac aaaaacaaat gtgtactata agactttcta aacaattcta actttagcat 2949 
tgtgaacgag acataagtgt taagaagaca taacaattat aatggaagaa gtttgtctcc 3 009 
atttatatat tatatattac ccacttatgt attatattag gatgttaagg agacataaca 3069 
attataaaga gagaagtttg tatccattta tatattatat actacccatt tatatattat 3129 
acttatccac ttatttaatg tctttataag gtttgatcca tgatatttct aatattttag 3189 
ttgatatgta tatgaaaggg tactatttga actctcttac tctgtataaa ggttggatca 3249 
tccttaaagt gggtctattt aattttattg cttcttacag ataaaaaaaa aattatgagt 3309 
tggtttgata aaatattgaa ggatttaaaa taataataaa taataaataa catataatat 33 69 
atgtatataa atttattata atataacatt tatctataaa aaagtaaata ttgtcataaa 3429 
tctatacaat cgtttagcct tgctggacga ctctcaatta tttaaacgag agtaaacata 3489 
tttgactttt tggttattta acaaattatt atttaacact atatgaaatt tttttttttt 3549 
ateggcaagg aaataaaatt aaattaggag ggacaatggt gtgtcccaat ccttatacaa 3609 
ccaacttcca caggaaggtc aggtegggga caacaaaaaa acaggcaagg gaaatttttt 3 6 69 
aatttgggtt gtcttgtttg ctgeataatt tatgeagtaa aacactacac ataacccttt 3729 
tagcagtaga gcaatggttg accgtgtgct tagcttcttt tattttattt ttttatcagc 3789 
aaagaataaa taaaataaaa tgagacactt cagggatgtt tcaaccctta tacaaaaccc 3849 
caaaaacaag tttcctagca ccctaccaac taaggtacc 3888 



<210> 17 
<211> 118 
<212> PRT 
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<400> 17 




I JC I— 

i 




Sen 


inr 


J. 

Gin 


Tyr 


Pro 


Met 








20 


Arg 


Gly 


Ser 


Asp 






35 




Ala 


Val 


Thr 


Ala 




50 






Val 


Gly 


Thr 


Val 


65 








Phe 


Ser 


Pro 


lie 


Thr 


Gly 


Phe 


Leu 








100 


Phe 


Ser 


Trp 


lie 






115 





Ala Arg Gly Thr 
5 

Met Gly Arg Asp 

Tyr Ser Lys Ser 

40 

Gly Gly Ser Leu 
55 

lie Ala Leu Thr 
70 

Leu Val Pro Ala 
85 

Ser Ser Gly Gly 
Tyr Lys 



His His Asp lie 
10 

Arg Asp Gin Tyr 
25 

Arg Gin lie Ala 

Leu Val Leu Ser 

60 

Val Ala Thr Pro 
75 

Leu lie Thr Val 
90 

Phe Gly He Ala 
105 



He Gly Arg Asp 
15 

Gin Met Ser Gly 
30 

Lys Ala Ala Thr 
45 

Ser Leu Thr Leu 

Leu Leu Val He 

80 

Ala Leu Leu He 
95 

Ala He Thr Val 
110 



<210> 18 
<211> 169 
<212> PRT 
<213> Artifcial 

<400> 18 
Tyr Ala Thr Gly 
1 

Arg Met Lys Leu 

2 0 

Tyr Tyr Gly Gin 
35 

Arg Gly Gly Gin 
50 

Ala Cys His Thr 
65 

Glu Ser Lys Thr 

Pro Cys Arg Phe 

100 

Pro Asn Val Leu 
115 

Ala Ser Asp Trp 
130 

Glu Gly Lys He 
145 

Gin Ser Thr He 



sequence 



Glu His Pro Gin 
5 

Gly Ser Lys Ala 

Gin His Thr Gly 

40 

His Thr Thr Met 
55 

Val Glu Thr Trp 
70 

Leu Val Val Val 
85 

He Ala Pro Phe 

Phe Leu ' Lys Val 

120 

Ala He Gin Ala 
135 

Leu Asp Lys Val 
150 

Ala Lys His Leu 
165 



Gly Ser Asp Lys 
10 

Gin Asp Leu Lys 
25 

Gly Glu His Asp 

Ala Ser Glu Glu 

60 

Asn Glu Gin Leu 
75 

Asp Phe Thr Ala 
90 

Phe Ala Asp Leu 
105 

Asp Thr Asp Glu 

Met Pro Thr Phe 

140 

Val Gly Ala Lys 
155 

Ala 



Leu Asp Ser Ala 
15 

Asp Arg Ala Gin 
30 

Arg Asp Arg Thr 
45 

Gly Gin Val He 

Gin Lys Ala Asn 

8 0 

Ser Trp Cys Gly 
95 

Ala Lys Lys Leu 
110 

Leu Lys Ser Val 
125 

Met Phe Leu Lys 

Lys Asp Glu Leu 

160 



<210> 19 
<211> 3888 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<221> CDS 

<222> (1555) . . . (2249) 
<221> CDS 

<222> (2490) . . . (2658) 
<400> 19 , 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
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tggtttttta 
ttacttgtta 
tatccctaca 
ccgtctatct 
agcgttggta 
caatataaac 
ttatcaaata 
aatttaactc 
agccattttt 
aaatttcacc 
ataattccca 
cttctaaagt 
tattacttct 
aattgttgaa 
agtttaaagt 
tataagattt 
atgtaagtcc 
atgcatggat 
acaaaacgca 
catgtatgtc 
cccatctcaa 
tatatattca 
atgcgtgtca 
atacctataa 
actactctac 



cctctattta 
ctttaatttc 
aatttattat 
ttaatgtagt 
gaaagcataa 
aaattcttta 
tttttcaacc 
cataattttt 
ttctttgaat 
aaacaatcat 
tttgacacta 
aattttaata 
aaaaaattaa 
tttgtgacta 
aaatataagt 
atggtggact 
gtaactggaa 
gcttgcgcaa 
atcacacaac 
taaatgccat 
cccacacaca 
ttctcttccg 
tcccatgccc 
atacctctaa 
tactataata 



aaggggtttt 
tcataatctt 
ttgttaaaca 
ctaacatttt 
tgatttattc 
ccttaagaag 
acgtaaatct 
ttattcgact 
aaaaaaatcc 
ttgtggtatt 
cggaagtaac 
atagttacta 
ttagatataa 
ttgatttatt 
aatgtagtag 
aattttcata 
ttactgtggg 
gaaaaagaca 
caactcaaat 
gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



ccacctaaaa 
tggttgaaat 
ttttcaaafcc 
catattgaaa 
ttattcttct 
gatttcccat 
cataataata 
gatcttaaag 
aattatcatt 
tctgaagcaa 
tgaagatctg 
tattcaagat 
ttaaaatatt 
attctactat 
agtgttagag 
tatttcttat 
ttgccatggc 
aagaacaaag 
tagtcactgg 
acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
actcatattc 



attctggtat 
tatcacgctt 
gcataaaatt 
tatataattt 
tcatataaat 
tttatatttt 
agttgtttca 
caacacccag 
gtattttttt 
gtcatgttat 
cttttacatg 
ttcatatatc 
acttttttaa 
gtttaaattg 
tgttacccta 
tgcttttacc 
actctgtggt 
aaaaaagaca 
ctgatcaaga 
catgcacttt 
atcatcacca 
caacacacgt 
accaccttct 
atccatccat 
aatactactc 



cattctcact 12 0 
ccgcacacga 18 0 
ttatgaagtc 240 
acttaatttt 300 
gtttaatata 360 
aaaaatatat 42 0 
aaagtaataa 480 
tgacacaact 54 0 
tatacaatga 60 0 
gcaaaattct 66 0 
cgagacacat 72 0 
aaatactcaa 780 
ttttaagttt 840 
ttttatagat 900 
aaccataaac 960 
ttttcttggt 1020 
cttttggttc 1080 
aaacagagag 1140 
tcgccgcgtc 1200 
aaatggctca 1260 
caaccacctg 1320 
caacctgcat 13 8 0 
ctcttatata 1440 
ccagagtact 15 0 0 
tact atg 1557 
Met 
1 



get teg gaa gaa gga caa gtg ate gec tgc cac acc gtt gag aca tgg 16 05 

Ala Ser Glu Glu Gly Gin Val He Ala Cys His Thr Val Glu Thr Trp 

5 10 15 

aac gag cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg gtt 1653 

Asn Glu Gin Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val Val 

20 25 30 

gat ttc acg get tct tgg tgt gga cca tgt cgt tte ate get cca ttc 

Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe lie Ala Pro Phe 

35 40 45 

ttt get gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag gtt 

Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val 

50 55 60 65 

gat act gat gaa ttg aag teg gtg gca agt gat tgg gcg ata cag gcg 1797 

Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala He Gin Ala 

70 75 BO 



1701 



1749 



atg cca acc ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa gtt 

Met Pro Thr Phe Met Phe Leu Lys Glu Gly Lys He Leu Asp Lys Val 

85 " 90 95 

gtt gga gec aag aaa gat gag ctt cag tct acc att gec aaa cac ttg 

Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr He Ala Lys His Leu 

100 " 105 110 

get atg gcg gat aca get aga gga acc cat cac gat ate ate ggc aga 

Ala Met Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg 
115 120 ~ 125 

gac cag tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec 

Asp Gin Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser 

130 A 135 140 145 

gga cga gga tct gac tac tec aag tct agg cag att get aaa get gca 

Gly Arg Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala 

150 155 , 160 



1845 



1893 



1941 



1989 



2037 
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act get gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt ace 2085 

Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr 

165 " 170 175 

ctt gtt gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt 2133 

Leu Val Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val 

180 185 190 

ate ttc age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc 2181 

He Phe Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu 

195 200 205 



ate acc ggt ttt ctt tec tct gga ggg ttt ggc att gec get ata ace 
He Thr Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr 
210 1 215 220 225 



2229 



gtt ttc tct tgg att tac aa gtaagcacac atttatcatc ttacttcata 2279 
Val Phe Ser Trp He Tyr Lys 

230 

attttgtgca atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt 2339 
cgaataacaa atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat 23 99 
acgaaatttg acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg 2459 
cttggtatga tacctattga ttgtgaatag g tac gca acg gga gag cac cca 2511 

Tyr Ala Thr Gly Glu His Pro 
235 

cag gga tea gac aag ttg gac agt gca agg atg aag ttg gga age aaa 2 559 
Gin Gly Ser Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys 
240 245 250 255 

get cag gat ctg aaa gac aga get cag tac tac gga cag caa cat act 
Ala Gin Asp Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr 

260 265 270 



2607 



ggt ggg gaa cat gac cgt gac cgt act cgt ggt ggc cag cac act act 2655 
Gly Gly Glu His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr 

275 280 285 

taa gcttaataag tatgaactaa aatgcatgta ggtgtaagag ctcatggaga 2708 
* 



gcatggaata 
ctatgaataa 
tatgataaat 
tcaaatagta 
ttgtgaacga 
catttatata 
aattataaag 
tacttatcca 
gttgatatgt 
atccttaaag 
ttggtttgat 
tatgtatata 
atctatacaa 
atttgacttt 
tateggcaag 
accaacttcc 
taatttgggt 
ttagcagtag 
caaagaataa 
ccaaaaacaa 

<210> 20 
<211> 232 



ttgtatccga 
acaaaggatg 
ttcctcttat 
caaaaacaaa 
gacataagtg 
ttatatatta 
agagaagttt 
cttatttaat 
atatgaaagg 
tgggtctatt 
aaaatattga 
aatttattat 
tegtttagee 
ttggttattt 
gaaataaaat 
acaggaaggt 
tgtcttgttt 
agcaatggtt 
ataaaataaa 
gtttcctagc 



ccatgtaaca 
ttatgatata 
tattataaat 
tgtgtactat 
ttaagaagac 
cccacttatg 
gtatccattt 
gtctttataa 
gtactatttg 
taattttatt 
aggatttaaa 
aatataacat 
ttgctggacg 
aacaaattat 
taaattagga 
caggtegggg 
getgeataat 
gaccgtgtgc 
atgagacact 
accctaccaa 



gtataataac 
ttaacactct 
catctgaatc 
aagactttct 
ataacaatta 
tattatatta 
atatattata 
ggttcgatcc 
aactctctta 
gcttcttaca 
ataataataa 
ttatctataa 
actctcaatt 
tatttaacac 
gggacaatgg 
acaacaaaaa 
ttatgcagta 
ttagcttctt 
tcagggatgt 
ctaaggtacc 



tgagctccat 
atetatgeae 
gtgaeggett 
aaacaattct 
taatggaaga 
ggatgttaag 
tactacccat 
atgatatttc 
ctctgtataa 
gataaaaaaa 
ataataaata 
aaaagtaaat 
atttaaacga 
tatatgaaat 
tgtgtcccaa 
aacaggcaag 
aaacactaca 
ttattttatt 
ttcaaccctt 



ctcacttctt 2768 
cttattgttc 2828 
atggaatgct 2 8 88 
aactttagca 2 94 8 
agtttgtctc 3008 
gagacataac 30 68 
ttatatatta 3128 
taatatttta 3188 
aggttggatc 324 8 
aaattatgag 33 08 
acatataata 3368 
attgtcataa 3428 
gagtaaacat 34 88 
tttttttttt 3548 
tccttataca 3608 
ggaaattttt 3 668 
cataaccctt 3728 
tttttatcag 3788 
atacaaaacc 3 848 

3888 
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<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Chimeric 



<400> 20 
Met Ala Ser Glu 
1 

Trp Asn Glu Gin 

20 

Val Asp Phe Thr 
35 

Phe Phe Ala Asp 
50 

Val Asp Thr Asp 
65 

Ala Met Pro Thr 

Val Val Gly Ala 

100 

Leu Ala Met Ala 
115 

Arg Asp Gin Tyr 
130 

Ser Gly Arg Gly 
145 

Ala Thr Ala Val 

Thr Leu Val Gly 

180 

Val lie Phe Ser 
195 

Leu lie Thr Gly 
210 

Thr Val Phe Ser 
225 



Glu 


Gly 


Gin 


Val 


5 

T - 

Leu 


tjj_n 


Lys 


Ala 


Ala 


Ser 


Trp 


Cys 








40 


Leu 


Ala 


Lys 


Lys 






55 




Glu 


Leu 


Lys 


Ser 




70 






Pne 


Met 


Phe 


Leu 


85 








Lys 


Lys 


Asp 


Glu 


Asp 


Thr 


Ala Arg 








120 


Pro 


Met 


Met 


Gly 






135 




Ser 


Asp 


Tyr 


Ser 




150 






Thr 


Ala 


Gly Gly 


165 








Thr 


Val 


He 


Ala 


Pro 


He 


Leu 


Val 








200 


Phe 


Leu 


Ser 


Ser 






215 




Trp 


He 


Tyr 


Lys 




230 







He Ala Cys His 
10 

Asn Glu Ser Lys 
25 

Gly Pro Cys Arg 

Leu Pro Asn Val 

60 

Val Ala Ser Asp 
75 

Lys Glu Gly Lys 
90 

Leu Gin Ser Thr 
105 

Gly Thr His His 

Arg Asp Arg Asp 

140 

Lys Ser Arg Gin 
155 

Ser Leu Leu Val 
170 

Leu Thr Val Ala 
185 

Pro Ala Leu He 

Gly Gly Phe Gly 

220 



Thr Val Glu Thr 
15 

Thr Leu Val Val 
30 

Phe He Ala Pro 
45 

Leu Phe Leu Lys 

Trp Ala He Gin 

80 

He Leu Asp Lys 
95 

He Ala Lys His 
110 

Asp He He Gly 
125 

Gin Tyr Gin Met 

lie Ala LyB Ala 

160 

Leu Ser Ser Leu - 
175 

Thr Pro Leu Leu 
190 

Thr Val Ala Leu 
205 

He Ala Ala He 



<210> 21 
<211> 55 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 21 

Tvr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Asp Ser Ala 

1 5 10 15 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 

20 * 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg Asp Arg Thr 

35 40 45 

Arg Gly Gly Gin His Thr Thr 
50 55 



<210> 22 
<211> 3787 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<221> CDS 
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<222> (1555) 



(2556) 



<400> 22 

ctgcaggaat 

tggtttttta 

ttacttgtta 

tatccctaca 

ccgtctatct 

agcgttggta 

caatataaac 

ttatcaaata 

aatttaactc 

agccattttt 

aaatttcacc 

ataattccca 

cttctaaagt 

tattacttct 

aattgttgaa 

agtttaaagt 

tataagattt 

atgtaagtcc 

atgcatggat 

acaaaacgca 

catgtatgtc 

cccatctcaa 

tatatattca 

atgcgtgtca 

atacctataa 

actactctac 



tcattgtact 
cctctattta 
ctttaatttc 
aatttattat 
ttaatgtagt 
gaaagcataa 
aaattcttta 
tttttcaacc 
cataattttt 
ttctfctgaat 
aaacaatcat 
tttgacacta 
aattttaata 
aaaaaattaa 
tttgtgacta 
aaatataagt 
atggtggact 
gtaactggaa 
gcttgcgcaa 
atcacacaac 
taaatgccat 
cccacacaca 
ttctcttccg 
tcccatgccc 
atacctctaa 
tactataata 



cccagtatca 
aaggggtttt 
tcataatctt 
ttgttaaaca 
ctaacatttt 
tgatttattc 
ccttaagaag 
acgtaaatct 
ttattcgact 
aaaaaaatcc 
ttgtggtatt 
cggaagtaac 
atagttacta 
ttagatataa 
ttgatttatt 
aatgtagtag 
aattttcata 
ttactgtggg 
gaaaaagaca 
caactcaaat 
gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



ttatagtgaa 
ccacctaaaa 
tggttgaaat 
ttttcaaacc 
catattgaaa 
ttattcttct 
gatttcccat 
cataataata 
gatcttaaag 
aattatcatt 
tctgaagcaa 
tgaagatctg 
tattcaagat 
ttaaaatatt 
attctactat 
agtgttagag 
tatttcttat 
ttgccatggc 
aagaacaaag 
tagtcactgg 
acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
acbcatattc 



agttttggct 
attctggtat 
tatcacgctt 
gcataaaatt 
tatataattt 
tcatataaat 
tttatatttt 
agttgtttca 
caacacccag 
gtattttttt 
gtcatgttat 
cttttacatg 
ttcatatatc 
acttttttaa 
gtttaaattg 
tgttacccta 
tgcttttacc 
actctgtggt 
aaaaaagaca 
ctgatcaaga 
catgcacttt 
atcatcacca 
caacacacgt 
accaccttct 
atccatccat 
aatactactc 



ctctcgccgg 60 
cattctcact 120 
ccgcacacga 18 0 
ttatgaagtc 24 0 
acttaatttt 300 
gtttaatata 360 
aaaaatatat 42 0 
aaagtaataa 48 0 
tgacacaact 54 0 
tatacaatga 60 0 
gcaaaattct 66 0 
cgagacacat 720 
aaatactcaa 780 
ttttaagttt 840 
ttttatagat 900 
aaccataaac 960 
ttttcttggt 1020 
cttttggttc 1080 
aaacagagag 114 0 
tcgccgcgtc 12 0 0 
aaatggctca 12 60 
caaccacctg 1320 
caacctgcat 13 8 0 
ctcttatata 1440 
ccagagtact 1500 
tact atg 1557 
Met 
1 



aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc 
Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser Gly 

5 10 15 



1605 



cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa 
Pro Ala Ala Hie Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu Lys 
20 25 30 



1653 



cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt 
Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly Gly 
35 * 40 45 



1701 



caa eta aca acc acc acc gac gtc gag aat ttc ccc gga ttt cca gaa 
Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Glu 
50 55 60 65 



1749 



ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag 17 97 

Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 

70 75 80 

cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc 1845 

Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 

85 90 95 



tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc get 
Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 
100 ~ 105 110 



1893 



gac get gtg att etc get act gga get gtg get aag egg ctt age ttc 

Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe 

115 120 125 

gtt gga tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec 

Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser 
130 ~ 135 140 145 



1941 



1989 
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get tgt get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct 203 7 
Ala Cys Ala Val Cys Asp Gly Ala Ala Pro lie Phe Arg Asn Lys Pro 

150 155 160 

ctt gcg gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt 2 085 
Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 

165 170 175 

ctt aca aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get 213 3 
Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala 
180 ^ 185 190 

ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct aag 2181 
Phe Arg Ala Ser Lys lie Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 
195 200 205 

att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga 222 9 
lie Asp Val lie Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 
210 215 220 225 

gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga 2277 
Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 

230 235 240 

gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat 2325 
Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His 

245 250 255 

gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat 2373 
Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp 
260 265 270 

ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga 2421 
Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly 
275 2B0 285 

gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gec ate 24 69 
Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He 
290 ' 295 300 305 

act get gca gga act ggg tgc atg gca get ttg gat gca gag cat tac 2517 
Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr 

310 315 320 

tta caa gag att gga tct cag caa ggt aag agt gat tga agcttaataa 2566 
Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp * 

325 330 

gtatgaacta aaatgcatgt aggtgtaaga gctcatggag agcatggaat attgtatccg 2 626 
accatgtaac agtataataa ctgagctcca tctcacttct tctatgaata aacaaaggat 2686 
gttatgatat attaacactc tatctatgea ccttattgtt ctatgataaa tttcctctta 2746 
ttattataaa tcatctgaat cgtgacggct tatggaatgc ttcaaatagt acaaaaacaa 2 8 06 
atgtgtacta taagactttc taaacaattc taactttagc attgtgaacg agacataagt 2 8 66 
gttaagaaga cataacaatt ataatggaag aagtttgtct ccatttatat attatatatt 2 926 
acccacttat gtattatatt aggatgttaa ggagacataa caattataaa gagagaagtt 2 9 86 
tgtatccatt tatatattat atactaccca tttatatatt atacttatcc acttatttaa 3046 
tgtctttata aggtttgatc catgatattt ctaatatttt agttgatatg tatatgaaag 3106 
ggtactattt gaactctctt actctgtata aaggttggat catccttaaa gtgggtctat 3166 
ttaattttat tgettcttae agataaaaaa aaaattatga gttggtttga taaaatattg 3 226 
aaggatttaa aataataata aataataaat aacatataat atatgtatat aaatttatta 32 86 
taatataaca tttatctata aaaaagtaaa tattgtcata aatctataca ategtttage 3 3 46 
ettgetggae gactctcaat tatttaaacg agagtaaaca tatttgactt tttggttatt 340 6 
taacaaatta ttatttaaca ctatatgaaa tttttttttt ttatcggcaa ggaaataaaa 3466 
ttaaattagg agggacaatg gtgtgtccca atccttatac aaccaacttc cacaggaagg 352 6 
teaggteggg gacaacaaaa aaacaggcaa gggaaatttt ttaatttggg ttgtcttgtt 3586 
tgctgcataa tttatgcagt aaaacactac acataaccct tttagcagta gagcaatggt 3646 
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tgaccgtgtg cttagcttct tttattttat ttttttatca gcaaagaata aataaaataa 3706 
aatgagacac ttcagggatg tttcaaccct tatacaaaac cccaaaaaca agtttcctag 3766 
caccctacca actaaggtac c 3787 

<210> 23 
<211> 333 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Chimeric 



<400> 23 
Met Asn Gly Leu 
1 

Gly Pro Ala Ala 

20 

Lys Pro Leu Leu 
35 

Gly Gin Leu Thr 
50 

Glu Gly lie Leu 
65 

Glu Arg Phe Gly 

Phe Ser Ser Lys 

100 

Ala Asp Ala Val 
115 

Phe Val Gly Ser 
130 

Ser Ala Cys Ala 
145 

Pro Leu Ala Val 

Phe Leu Thr Lys 

180 

Ala Phe Arg Ala 
195 

Lys lie Asp Val 
210 

Gly Glu Arg Asp 
225 

Gly Asp Val Ser 

His Glu Pro Ala 

260 

Asp Gly Tyr Val 
275 

Gly Val Phe Ala 
290 

lie Thr Ala Ala 
305 

Tyr Leu Gin Glu 



Glu 


Thr 


His 


Asn 


5 








His 


Thr 


Ala 


Ala 


Phe 


Glu 


Gly 


Trp 








40 


Thr 


Thr 


Thr 


ASp 






55 




Gly 


Val 


Glu 


Leu 




70 






Thr 


_r_ jy 

Thr 


He 


Phe 


85 








Pro 


Phe 


Lys 


Leu 


lie 


Leu 


Ala 


Thr 








120 


Gly 


Glu 


Gly 


Ser 






135 




Val 


Cys 


Asp 


Gly 




150 






lie 


Gly 


Gly 


Gly 


165 








Tyr 


Gly 


Ser 


Lys 


Ser 


Lys 


lie 


Met 








200 


He 


Trp 


Asn 


Ser 






215 




Val 


Leu 


Gly 


Gly 




230 






Asp 


Leu 


Lys 


Val 


245 








Thr 


Lys 


Phe 


Leu 


Val 


Thr 


Lys 


Pro 








280 


Ala 


Gly 


Asp 


Val 






295 




Gly 


Thr 


Gly 


Cys 




310 






He 


Gly 


Ser 


Gin 



Thr Arg Leu Cys 
10 

He Tyr Ala Ala 
25 

Met Ala Asn Asp 

Val Glu Asn Phe 

60 

Thr Asp Lys Phe 
75 

Thr Glu Thr Val 
90 

Phe Thr Asp Ser 
105 

Gly Ala Val Ala 

Gly Gly Phe Trp 

140 

Ala Ala Pro He 
155 

Asp Ser Ala Met 
170 

Val Tyr He He 
185 

Gin Gin Arg - Ala 

Ser Val Val Glu 

220 

Leu Lys Val Lys 
235 

Ser Gly Leu Phe 
250 

Asp Gly Gly Val 
265 

Gly Thr Thr Gin 

Gin Asp Lys Lys 

300 

Met Ala Ala Leu 
315 

Gin Gly Lys Ser 
330 



He Val Gly Ser 
15 

Arg Ala Glu Leu 
30 

He Ala Pro Gly 
45 

Pro Gly Phe Pro 

Arg Lys Gin Ser 

80 

Thr Lys Val Asp 
95 

Lys Ala He Leu 
110 

Lys Arg Leu Ser 
125 

Asn Arg Gly He 

Phe Arg -Asn Lys 

160 

Glu Glu Ala Asn 
175 

His Arg Arg Asp 
190 

Leu Ser Asn Pro 
205 

Ala Tyr Gly Asp 

Asn Val Val Thr 

240 

Phe Ala He Gly 
255 

Glu Leu Asp Ser 
270 

Thr Ser Val Pro 
285 

Tyr Arg Gin Ala 

Asp Ala Glu His 

320 

Asp 



<210> 24 
<211> 4546 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1555) . . . (1907) 

<221> CDS 
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<222> (2148) . . . (3315) 
<223> Chimeric 



<400> 24 

ctgcaggaat 

tggtttttta 

ttacttgtta 

tatccctaca 

ccgtctatct 

agcgttggta 

caatataaac 

ttatcaaata 

aatttaactc 

agccattttt 

aaatttcacc 

ataattccca 

cttctaaagt 

tattacttct 

aattgttgaa 

agtttaaagt 

tataagattt 

atgtaagtcc 

atgcatggat 

acaaaacgca 

catgtatgtc 

cccatctcaa 

tatatattca 

atgcgtgtca 

atacctataa 

actactctac 



tcattgtact 
cctctattta 
ctttaatttc 
aatttattat 
ttaatgtagt 
gaaagcataa 
aaattcttta 
tttttcaacc 
cataattttt 
ttctttgaat 
aaacaatcat 
tttgacacta 
aattttaata 
aaaaaattaa 
tttgtgacta 
aaatataagt 
afcggtggact 
gtaactggaa 
gcttgcgcaa 
atcacacaac 
taaatgccat 
cccacacaca 
ttctcttccg 
tcccatgccc 
atacctctaa 
tactataata 



cccagtatca 
aaggggtttt 
tcataatctt 
ttgttaaaca 
ctaacatttt 
tgatttattc 
ccttaagaag 
acgtaaatct 
ttattcgact 
aaaaaaatcc 
ttgtggtatt 
cggaagtaac 
atagttacta 
ttagatataa 
ttgatttatt 
aatgtagtag 
aattttcata 
ttactgtggg 
gaaaaagaca 
caactcaaat 
gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



ttatagtgaa 
ccacctaaaa 
tggttgaaat 
ttttcaaacc 
catattgaaa 
ttattcttct 
gatttcccat 
cataataata 
gatcttaaag 
aattatcatt 
tctgaagcaa 
tgaagatctg 
tattcaagat 
ttaaaatatt 
attctactat 
agtgttagag 
tatttcttat 
ttgccatggc 
aagaacaaag 
tagtcactgg 
acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
actcatattc 



agttttggct 
attctggtat 
tatcacgctt 
gcataaaatt 
tatataattt 
tcatataaat 
tttatatttt 
agttgtttca 
caacacccag 
gtattttttt 
gtcatgttat 
cttttacatg 
ttcatatatc 
acttttttaa 
gtttaaattg 
tgttacccta 
tgcttttacc 
actctgtggt 
aaaaaagaca 
ctgatcaaga 
catgcacttt 
atcatcacca 
caacacacgt 
accaccttct 
atccatccat 
aatactactc 



ctctcgccgg 
cattctcact 
ccgcacacga 
ttatgaagtc 
acttaatttt 
gtttaatata 
aaaaatatat 
aaagtaataa 
tgacacaact 
tatacaatga 
gcaaaattct 
cgagacacat 
aaatactcaa 
ttttaagttt 
ttttatagat 
aaccataaac 
ttttcttggt 
cttttggttc 
aaacagagag 
tcgccgcgtc 
aaatggctca 
caaccacctg 
caacctgcat 
ctcttatata 
ccagagtact 
tact atg 
Met 
1 



gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 

Ala Asp Thr Ala Arg Gly Thr His His Asp lie He Gly Arg Asp Gin 

5 10 15 

tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 

Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 

20 25 30 

gga tct gac tac tec aag tct agg cag att get aaa get gca act get 

Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala 

3 5 *" 4 0 45 

gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 

Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 

50 55 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 

Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 

70 75 80 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

102 0 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1557 



1605 



1653 



1701 



1749 



1797 



age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 1845 

Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 

85 90 95 

ggt ttt ctt tec tct gga ggg ttt ggc att gec get ata acc gtt ttc 1893 

Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe 
100 105 110 



tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 
Ser Trp He Tyr Lys 
115 



1947 



atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2007 
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atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 20 67 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2127 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 217 8 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
120 125 

gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 22 26 
Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

ctg aaa gae aga get cag tac tac gga cag caa cat act ggt ggg gaa 2274 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
145 150 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg aat ggt 2322 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Asn Gly 

165 170 175 

etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc cca gcg 23 70 
Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser Gly Pro Ala 

180 185 190 

gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa cct ctt 2418 
Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu Lys Pro Leu 
195 200 ' 205 

etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt caa eta 24 66 
Leu Phe Glu Gly Trp Met Ala Asn Asp lie Ala Pro Gly Gly Gin Leu 
210 215 220 

aca acc acc acc gac gtc gag aat ttc ccc gga ttt cca gaa ggt att 2514 
Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Glu Gly He 
225 230 235 240 

etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag cga ttc 2 5 62 
Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu Arg Phe 

245 ' ~ 250 255 

ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc tct teg 2610 
Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe Ser Ser 

260 265 270 



aaa ccg ttt aag eta ttc aca gat tea aaa gec att etc get gac get 
Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala Asp Ala 
275 280 " 285 



2658 



gtg att etc get act gga get gtg get aag egg ctt age ttc gtt gga 270 6 

Val lie Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe Val Gly 
290 295 300 

tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec get tgt 2754 

Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser Ala Cys 
305 310 ~ 315 320 

get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct ctt gcg 2802 

Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro Leu Ala 

325 330 335 

gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt ctt aca 2 85 0 

Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe Leu Thr 

340 345 350 

aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get ttt aga 2898 

Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala Phe Arg 
355 360 365 

gcg tct aag att atg cag cag cga get ttg tct aat cct aag att gat 2 94 6 
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Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys He Asp 

370 375 380 

gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga gaa aga 

Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly Glu Arg 

385 390 395 400 

gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga gat gtt 

Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly Asp Val 

405 410 415 

tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat gag cca 

Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His Glu Pro 

420 425 430 

get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat ggt tat 

Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp Gly Tyr 

435 440 445 

gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga gtt ttc 

Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly Val Phe 

450 " 455 460 

get gcg ggt gat gtt cag gat aag aag tat agg caa gee ate act get 

Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He Thr Ala 

465 470 • 475 480 

gca gga act ggg tgc atg gca get ttg gat gca gag cat tac tta caa 

Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr Leu Gin 

485 490 495 



2994 



3042 



3090 



3138 



3186 



3234 



3282 



gag 
Glu 



att 
He 



gga 
Gly 



tdt 
Ser 
500 



cag 
Gin 



caa 
Gin 



ggt 

Gly 



aag 

Lys 



agt 
Ser 
505 



gat 

Asp 



tga 
* 



agcttaataa gtatgaacta 3335 



aaatgcatgt 
agtataataa 
attaacactc 
tcatctgaat 
taagactttc 
cataacaatt 
gtattatatt 
tatatattat 
aggtttgatc 
gaactctctt 
tgettcttae 
aataataata 
tttatctata 
gactctcaat 
ttatttaaca 
agggacaatg 
gacaacaaaa 
tttatgcagt 
cttagcttct 
ttcagggatg 
actaaggtac 



aggtgtaaga 
c t gage t cca 
tatctatgea 
cgtgacggct 
taaacaattc 
ataatggaag 
aggatgttaa 
atactaccca 
catgatattt 
actctgtata 
agataaaaaa 
aataataaat 
aaaaagtaaa 
tatttaaacg 
ctatatgaaa 
gtgtgtccca 
aaacaggcaa 
aaaacactac 
tttattttat 
tttcaaccct 



gctcatggag 
tctcacttct 
ccttattgtt 
tatggaatgc 
taactttagc 
aagtttgtct 
ggagacataa 
tttatatatt 
ctaatatttt 
aaggttggat 
aaaattatga 
aacatataat 
tattgtcata 
agagtaaaca 
tttttttttt 
atccttatac 
gggaaatttt 
acataaccct 
ttttttatca 
tatacaaaac 



agcatggaat 

tctatgaata 

ctatgataaa 

ttcaaatagt 

attgtgaacg 

ccatttatat 

caattataaa 

atacttatcc 

agttgatatg 

catccttaaa 

gttggtttga 

atatgtatat 

aatctataca 

tatttgactt 

ttatcggcaa 

aaccaacttc 

ttaatttggg 

tttagcagta 

gcaaagaata 

cccaaaaaca 



attgtatccg 
aacaaaggat 
tttcctctta 
acaaaaacaa 
agacataagt 
attatatatt 
gagagaagtt 
acttatttaa 
tatatgaaag 
gtgggtctat 
taaaatattg 
aaatttatta 
ategtttage 
tttggttatt 
ggaaataaaa 
cacaggaagg 
ttgtcttgtt 
gage aat ggt 
aataaaataa 
agtttcctag 



accatgtaac 33 95 
gttatgatat 34 55 
ttattataaa 3515 
atgtgtacta 3575 
gttaagaaga 3 635 
acccacttat 3695 
tgtatccatt 3755 
tgtctttata 3815 
ggtactattt 3875 
ttaattttat 3935 
aaggatttaa 3995 
taatataaca 4055 
ettgetggae 4115 
taacaaatta 4175 
ttaaattagg 4235 
teaggteggg 42 95 
tgctgcataa 4355 
tgaccgtgtg 4415 
aatgagacac 4475 
caccctacca 4535 

4546 



<210> 25 

<211> 118 

<212> PRT 

<213> Artificial Sequence 



<220> 
<223> 



Chimeric 



<400> 25 

Met Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp 
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Ayr 


Jtr X. O 










Z, U 


Arg 


Gly 


Ser 


Asp 






35 




Ala 


Val 


Thr 


Ala 




50 






Val 


Gly 


Thr 


Val 


65 








Phe 


Ser 


Pro 


He 


Thr 


Gly 


Phe 


Leu 








100 


Phe 


Ser 


Trp 


He 






115 





5 

Met Gly Arg Asp 

Tyr Ser Lys Ser 

40 

Gly Gly Ser Leu 
55 

He Ala Leu Thr 
70 

Leu Val Pro Ala 
85 

Ser Ser Gly Gly 
Tyr Lys 



10 

Arg Asp Gin Tyr 
25 

Arg Gin He Ala 

Leu Val Leu Ser 

60 

Val Ala Thr Pro 
75 

Leu He Thr Val 
90 

Phe Gly He Ala 
105 



15 

Gin Met Ser Gly 
30 

Lys Ala Ala Thr 
45 

Ser Leu Thr Leu 

Leu Leu Val He 

80 

Ala Leu Leu He 
95 

Ala He Thr Val 
110 



<210> 26 
<211> 388 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<;400> 26 



1 1 1 — ■ - - 

Tyr 


Ala 


inr 


Gly 


Glu 


xIJlS 


Pro 


Gin 


vjiy 


O a "V* 




Lys 


Leu 


Asp 


Ser 


Ala 


j. 






5 










J. u 










15 




Arg 


Met 


Lys 


Leu 


Gly 


Ser 


Lys 


Ala 


Gin 


Asp 


lieu 


Lys 


Asp 


Arg 


Ala 


Gin 






20 




















30 






Tyr 


Tyr 


Gly 
35 


Gin 


Gin 


His 


Thr 


Gly 
40 


Gly 


Glu 


His 


Asp 


Arg 
45 


Asp 


Arg 


Thr 


Arg 


Gly 
50 


Gly 


Gin 


His 


Thr 


Thr 
55 


Met 


Asn 


Gly 


Leu 


Glu 
60 


Thr 


His 


Asn 


Thr 


Arg 


Leu 


Cys 


He 


Val 


Gly 


Ser 


Gly 


Pro 


Ala 


Ala 


His 


Thr 


Ala 


Ala 


He 


65 










70 










75 










80 


Tyr 


Ala 


Ala 


Arg 


Ala 


Glu 


Leu 


Lys 


Pro 


Leu 


Leu 


Phe 


Glu Gly 


Trp 


Met 






85 










90 










95 




Ala 


Asn 


Asp 


He 


Ala 


Pro 


Gly Gly 


Gin 


Leu 


Thr 


Thr 


Thr 


Thr 


Asp 


Val 






100 










105 










110 






Glu 


Asn 


Phe 


Pro 


Gly 


Phe 


Pro 


Glu 


Gly 


He 


Leu 


Gly Val 


Glu 


Leu 


Thr 






115 








120 










125 








Asp 


Lys 
130 


Phe 


Arg 


Lys 


Gin 


Ser 
135 


Glu 


Arg 


Phe 


Gly 


Thr 
140 


Thr 


He 


Phe 


Thr 


Glu 


Thr 


Val 


Thr 


Lys 


Val 


Asp 


Phe 


Ser 


Ser 


Lys 


Pro 


Phe 


Lys 


Leu 


Phe 


145 










150 










155 










160 


Thr 


Asp 


Ser 


Lya 


Ala 


He 


Leu 


Ala 


Asp 


Ala 


Val 


He 


Leu 


Ala 


Thr 


Gly 






165 










170 










175 




Ala 


Val 


Ala 


Lys 
180 


Arg 


Leu 


Ser 


Phe 


Val 
185 


Gly 


Ser 


Gly 


Glu 


Gly 
190 


Ser 


Gly 


Gly 


Phe 


Trp 


Asn Arg 


Gly 


He 


Ser 


Ala 


Cys 


Ala 


Val 


Cys 


Asp 


Gly Ala 






195 










200 










205 








Ala 


Pro 


He 


Phe 


Arg 


Asn 


Lys 


Pro 


Leu 


Ala 


Val 


He 


Gly Gly Gly Asp 




210 










215 










220 










Ser 


Ala 


Met 


Glu 


Glu 


Ala 


Asn 


Phe 


Leu 


Thr 


Lys 


Tyr 


Gly 


Ser 


Lys 


Val 


225 










230 










235 










240 


Tyr 


He 


He 


His 


Arg 
245 


Arg 


Asp 


Ala 


Phe 


Arg 
250 


Ala 


Ser 


Lys 


He 


Met 
255 


Gin 


Gin 


Arg 


Ala 


Leu 


Ser 


Asn 


Pro 


Lys 


He 


Asp 


Val 


He 


Trp Asn 


Ser 


Ser 








260 










265 










270 






Val 


Val 


Glu 


Ala 


Tyr 


Gly 


Asp 


Gly 


Glu 


Arg 


Asp 


Val 


Leu Gly 


Gly 


Leu 






275 










280 










285 








Lys 


Val 


Lys 


Asn 


Val 


Val 


Thr Gly 


Asp 


Val 


Ser 


Asp 


Leu 


Lys 


Val 


Ser 




290 










295 










300 










Gly 


Leu 


Phe 


Phe 


Ala 


He 


Gly His 


Glu 


Pro 


Ala 


Thr 


Lys 


Phe 


Leu 


Asp 


305 










310 










315 










320 


Gly 


Gly 


Val 


Glu 


Leu 


Asp 


Ser 


Asp 


Gly 


Tyr 


Val 


Val 


Thr 


Lys 


Pro 


Gly 
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325 

Thr Thr Gin Thr Ser Val Pro Gly 

340 

Asp Lys Lys Tyr Arg Gin Ala lie 
355 360 
Ala Ala Leu Asp Ala Glu His Tyr 

370 375 
Gly Lys Ser Asp 
385 



330 335 
Val Phe Ala Ala Gly Asp Val Gin 
345 350 
Thr Ala Ala Gly Thr Gly Cys Met 

365 

Leu Gin Glu lie Gly Ser Gin Gin 

380 



<210> 27 
<211> 4545 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<221> CDS 
<222> (1555) . 



. (2906) 



<221> CDS 

<222> (3147) . . . (3315) 



<400> 27 

ctgcaggaat 

tggtttttta 

ttacttgtta 

tatccctaca 

ccgtctatct 

agcgttggta 

caatataaac 

ttatcaaata 

aatttaactc 

agccattttt 

aaatttcacc 

ataattccca 

cttctaaagt 

tattacttct 

aattgttgaa 

agtttaaagt 

tataagattt 

atgtaagtcc 

atgcatggat 

acaaaacgca 

catgtatgtc 

cccatctcaa 

tatatattca 

atgcgtgtca 

atacctataa 

actactctac 



tcattgtact 

cctctattta 

ctttaatttc 

aatttattat 

ttaatgtagt 

gaaagcataa 

aaattcttta 

tttttcaacc 

cataattttt 

ttctttgaat 

aaacaatcat 

tttgacacta 

aattttaata 

aaaaaattaa 

tttgtgacta 

aaatataagt 

atggtggact 

gtaactggaa 

gcttgcgcaa 

atcacacaac 

taaatgccat 

cccacacaca 

ttctcttccg 

tcccatgccc 

atacctctaa 

tactataata 



cccagtatca 
aaggggtttt 
tcataatctt 
ttgttaaaca 
ctaacatttt 
tgatttattc 
ccttaagaag 
acgtaaatct 
ttattcgact 
aaaaaaatcc 
ttgtggtatt 
cggaagtaac 
atagttacta 
ttagatataa 
ttgatttatt 
aatgtagtag 
aattttcata 
ttactgtggg 
gaaaaagaca 
caactcaaat 
gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



ttatagtgaa 
ccacctaaaa 
tggttgaaat 
ttttcaaacc 
catattgaaa 
ttattcttct 
gatttcccat 
cataataata 
gatcttaaag 
aattatcatt 
tctgaagcaa 
tgaagatctg 
tattcaagat 
ttaaaatatt 
attctactat 
agtgttagag 
tatttcttat 
ttgccatggc 
aagaacaaag 
tagtcactgg 
acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
actcatattc 



agttttggct 

attctggtat 

tatcacgctt 

gcataaaat t 

tatataattt 

tcatataaat 

tttatatttt 

agttgtttca 

caacacccag 

gtattttttt 

gtcatgttat 

cttttacatg 

ttcatatatc 

acttttttaa 

gtttaaattg 

tgttacccta 

tgcttttacc 

actctgtggt 

aaaaaagaca 

ctgatcaaga 

catgcacttt 

atcatcacca 

caacacacgt 

accaccttct 

atccatccat 

aatactactc 



ctctcgccgg 
cattctcact 
ccgcacacga 
ttatgaagtc 
acttaatttt 
gtttaatata 
aaaaatatat 
aaagtaataa 
tgacacaact 
tatacaatga 
gcaaaattct 
cgagacacat 
aaatactcaa 
ttttaagttt 
ttttatagat 
aaccataaac 
ttttcttggt 
cttttggttc 
aaacagagag 
tcgccgcgtc 
aaatggctca 
caaccacctg 
caacctgcat 
ctcttatata 
ccagagtact 
tact atg 
Met 
1 



aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc 

Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser Gly 

5 10 15 

cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa 

Pro Ala Ala His Thr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu Lys 
20 25 30 

cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt 

Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp lie Ala Pro Gly Gly 
35 40 45 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1557 



1605 ' 



1653 



1701 



-22- 



WO 02/50289 



PCTYUS01/50240 



caa eta aca acc acc acc gac gtc gag aat ttc ccc gga ttt cca gaa 1749 
Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Fhe Pro Glu 
50 55 60 65 

ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag 1797 

Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 

70 75 BO 

» 

cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc 1845 

Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 

85 90 95 



tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc get 
Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 
100 105 HO 



gtt gga tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec 
Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser 
130 ^ "* 135 140 145 



gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga 
Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 

230 235 240 



1893 



gac get gtg att etc get act gga get gtg get aag egg ctt age ttc 1941 
Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe 
115 120 125 



1989 



get tgt get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct 203 7 
Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro 

150 155 160 

ctt gcg gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt 
Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 

165 170 175 



2085 



ctt aca aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get 2133 

Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala 
180 - ~ 185 19P 

ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct aag 2181 

Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 

195 200 205 

att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga 2229 

He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 

210 215 220 225 



2277 



gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat 2325 
Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His 

245 " 250 255 

gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat 2373 
Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp 
260 265 270 

ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga 2421 
Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly 
275 280 285 

gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gee ate 2469 
Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He 
290 295 300 305 

act get gca gga act ggg tgc atg gca get ttg gat gca gag cat tac 2517 
Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr 

310 315 320 

f 
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tta caa gag att gga tct cag caa ggt aag agt gat atg gcg gat aca 2565 

Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp Met Ala Asp Thr 

325 330 335 

get aga gga acc cat cac gat ate ate ggc aga gac cag tac ccg atg 2613 

Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin Tyr Pro Met 
340 345 350 

atg ggc cga gac cga gac cag tac cag atg tec gga cga gga tct gac 2 661 

Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg Gly Ser Asp 
355 ~ 360 365 

tac tec aag tct agg cag att get aaa get gca act get gtc aca get 2709 

Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala Val Thr Ala 

370 ' 375 380 385 

ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt gga act gtc 2757 

Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val Gly Thr Val 

390 395 400 



ata get ttg act gtt gca aca cct ctg etc gtt ate ttc age cca ate 
He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe Ser Pro He 

405 410 415 

ctt gtc ccg get etc ate aca gtt gca etc etc ate acc ggt ttt ctt 
Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr Gly Phe Leu 
420 425 430 



2805 



2853 



tec tct gga ggg ttt ggc att gee get ata acc gtt ttc tct tgg att 2901 
Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe Ser Trp He 
43 5 440 445 

tac aa gtaagcacac atttatcatc ttacttcata attttgtgca atatgtgcat 2956 

Tyr Lys 

450 

gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa atgtaacaat 3 016 
aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg acctagctag 3076 
cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga tacctattga 3136 
ttgtgaatag g tac gca acg gga gag cac cca cag gga tea gac aag ttg 3186 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu 

455 46 0 

gac agt gca agg atg aag ttg gga age aaa get cag gat ctg aaa gac 3234 
Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp 
465 470 475 480 

aga get cag tac tac gga cag caa cat act ggt ggg gaa cat gac cgt 32 82 
Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg 

485 490 495 

gac cgt act cgt ggt ggc cag cac act act taa gcttaataag tatgaactaa 3335 
Asp Arg Thr Arg Gly Gly Gin His Thr Thr * 

50 0 5 05 

aatgcatgta ggtgtaagag ctcatggaga gcatggaata ttgtatccga ccatgtaaca 3395 

gtataataac tgagctccat ctcacttctt ctatgaataa acaaaggatg ttatgatata 3455 

ttaacactct atetatgeae cttattgttc tatgataaat ttcctcttat tattataaat 3515 

catctgaatc gtgaeggett atggaatgct tcaaatagta caaaaacaaa tgtgtactat 3575 

aagactttct aaacaattct aactttagca ttgtgaacga gacataagtg ttaagaagac 3 63 5 

ataacaatta taatggaaga agtttgtctc catttatata ttatatatta cccacttatg 3695 

tattatatta ggatgttaag gagacataac aattataaag agagaagttt gtatccattt 3755 

atatattata tactacccat ttatatatta tacttatcca cttatttaat gtctttataa 3815 

ggtttgatcc atgatatttc taatatttta gttgatatgt atatgaaagg gtactatttg 3 875 

aactctctta ctctgtataa aggttggatc atccttaaag tgggtctatt taattttatt 3935 

gcttcttaca gataaaaaaa aaattatgag ttggtttgat aaaatattga aggatttaaa 3995 
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ataataataa ataataaata acatataata tatgtatata aatttattat aatataacat 4055 
ttatctataa aaaagtaaat attgtcataa atctatacaa tcgtttagcc ttgctggacg 4115 
actctcaatt atttaaacga gagtaaacat atttgacttt ttggttattt aacaaattat 4175 
tatttaacac tatatgaaat tttttttttt tatcggcaag gaaataaaat taaattagga 4235 
qggacaatgg tgtgtcccaa tccttataca accaacttcc acaggaaggt caggtcgggg 4295 
acaacaaaaa aacaggcaag ggaaattttt taatttgggt tgtcttgttt gctgcataat 4355 
ttatgcagta aaacactaca cataaccctt ttagcagtag agcaatggtt gaccgtgtgc 4415 
ttagcttctt ttattttatt tttttatcag caaagaataa ataaaataaa atgagacact 4475 
tcagggatgt ttcaaccctt atacaaaacc ccaaaaacaa gtttcctagc accctaccaa 4535 
c t aagg t ac c A " x 

<210> 28 
<211> 451 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 28 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser 

15 10 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 

35 40 45 

Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 

50 55 60 

Glu Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 BO 

Glu Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

85 90 95 

Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 

100 105 HO 

Ala Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser 

115 120 125 

Phe Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He 

130 135 140 

Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 
145 150 155 160 

Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 

165 170 175 

Phe Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He Has Axg Arg Asp 

180 ^ 185 190 

Ala Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro 

195 200 205 

Lys He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 

210 215 220 

Gly Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 
225 230 235 240 

Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly 

245 250 255 

His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 

260 265 270 

Asp Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro 

275 280 285 

Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 

290 295 300 

He Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His 
305 310 315 320 

Tyr Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp Met Ala Asp 

325 ~ 330 335 

Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin Tyr Pro 

340 345 350 

Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg Gly Ser 

355 " 360 365 

Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala Val Thr 
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370 

Ala Gly Gly Ser 
385 

Val lie Ala Leu 

lie Leu Val Pro 

420 

Leu Ser Ser Gly 
435 

lie Tyr Lys 
450 



375 

Leu Leu Val Leu 
390 

Thr Val Ala Thr 
405 

Ala Leu lie Thr 

Gly Phe Gly He 

440 



380 

Ser Ser Leu Thr 
3 95 

Pro Leu Leu Val 
410 

Val Ala Leu Leu 
425 

Ala Ala He Thr 



Leu Val Gly Thr 

400 

He Phe Ser Pro 
415 

He Thr Gly Phe 
430 

Val Phe Ser Trp 
445 



<210> 29 
<211> 55 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 29 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lya Leu Asp Ser Ala 

1 5 10 15 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 

20 ~ 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg Asp Arg Thr 

35 40 45 

Arg Gly Gly Gin His Thr Thr 
50 * 55 



<210> 30 

<211> 4922 

<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1555) . . . (1907) 
<221> CDS 

<222> (2148) . . . (3690) 
<223> Chimeric 
<400> 30 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 12 0 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 3 00 
agcgttggta gaaagcataa tgatttattc ttattcttct tcatataaat gtttaatata 3 60 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 72 0 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataagattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 102 0 
atgtaagtcc gtaactggaa ttactgtggg ttgccatggc actctgtggt cttttggttc 10 80 
atgcatggat gcttgcgcaa gaaaaagaca aagaacaaag aaaaaagaca aaacagagag 114 0 
acaaaacgca atcacacaac caactcaaat tagtcactgg ctgatcaaga tcgccgcgtc 1200 
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catgtatgtc 
cccatctcaa 
tatatattca 
atgcgtgtca 
atacctataa 
actactctac 



taaatgccat 
cccacacaca 
ttctcttccg 
tcccatgccc 
atacctctaa 
tactataata 



gcaaagcaac 
aacacattgc 
ccacctcaat 
aaatctccat 
tatcactcac 
ccccaaccca 



acgtgcttaa 
ctttttcttc 
ttcttcactt 
gcatgttcca 
ttctttcatc 
actcatattc 



catgcacttt 
atcatcacca 
caacacacgt 
accaccttct 
atccatccat 
aatactactc 



aaatggctca 
caaccacctg 
caacctgcat 
ctcttatata 
ccagagtact 
tact atg 
Met 
1 



gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 

Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp Gin 

5 10 15 

tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 

Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 

20 25 30 

gga tct gac tac tec aag tct agg cag' att get aaa get gca act get 

Gly Ser Asp Tyr Ser Lys Ser Arg Gin lie Ala Lys Ala Ala Thr Ala 

35 ~ 40 45 

gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 

Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 

50 ' 55 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 

Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 

70 75 80 

age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 

Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 

85 90 95 

ggt ttt ctt tec tct gga ggg ttt ggc att gee get ata acc gtt ttc 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala lie Thr Val Phe 

100 105 HO 

tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 
Ser Trp He Tyr Lys 
115 



1260 
1320 
1380 
1440 
1500 
1557 



1605 



1653 



1701 



1749 



1797 



1845 



1893 



1947 



atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2007 
atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2 0G7 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2127 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 2178 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
120 125 



gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 

Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 

Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 

145 ^ 150 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg aac acc 

His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Asn Thr 

165 170 175 



2226 



2274 



2322 



act cct tct gcg 
Thr Pro Ser Ala 

180 



cat gag acg ata cac gaa gtg ate gtt att ggc tec 
His Glu Thr He His Glu Val He Val lie Gly Ser 

185 190 



ggt cca gca ggc tac act get gee ctg tac gee get cgt gca cag eta 
Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg Ala Gin Leu 
195 200 205 



2370 



2418 
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aca ccg ctg gta ttt gag ggt acc tea ttc ggc ggc gcg ctg atg acc 
Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala Leu Met Thr 
210 215 220 



2466 



acc acc gag gtg gaa aac tac cca ggt ttt cgc aac ggc ata acc ggc 2514 
Thr Thr Glu Val Glu Asn Tyr Pro Gly Phe Arg Asia Gly lie Thr Gly 
225 230 235 240 

ccg gag ttg atg gac gat atg cgt gaa cag gca ctg cga ttc ggc gcg 2562 
Pro Glu Leu Met Asp Asp Met Arg Glu Gin Ala Leu Arg Phe Gly Ala 

245 250 255 

gaa ctg egg acc gaa gac gtc gag teg gta tea ttg cgt ggc ccg ate, 2610 
Glu Leu Arg Thr Glu Asp Val Glu Ser Val Ser Leu Arg Gly Pro He 

260 ~ 265 270 

aaa teg gtc gtc acc get gaa gga cag act tat cag gec cga gec gtc 2658 
Lys Ser Val Val Thr Ala Glu Gly Gin Thr Tyr Gin Ala Arg Ala Val 
275 280 285 



ate etc gec atg ggt acc tec gtg cgt tat eta cag ate ccc ggc gag 
He Leu Ala Met Gly Thr Ser Val Arg Tyr Leu Gin He Pro Gly Glu 
290 295 300 



tec ttt ttc cgc ggc caa gac att gee gtc att ggc ggt gga gac tea 
Ser Phe Phe Arg Gly Gin Asp He Ala Val He Gly Gly Gly Asp Ser 

325 330 335 

gcg atg gag gaa gee etc ttt ttg acc egg ttc gec cgc age gtc acg 
Ala Met Glu Glu Ala Leu Phe Leu Thr Arg Phe Ala Arg Ser Val Thr 

340 345 350 

etc gtg cac cgc cgc gac gaa ttc cga get tct aag ate atg etc ggt 
Leu Val His Arg Arg Asp Glu Phe Arg Ala Ser Lys He Met Leu Gly 
355 " * 360 365 

cgc gee cgt aac aat gac aag ate aaa ttc ate acc aac cac acc gtg 
Arg Ala Arg Asn Asn Asp Lys He Lys Phe He Thr Asn His Thr Val 
370 ^ 375 380 



gca att ggc cat gaa cca cgt tec age ctg gtg age gat gtc gtc gac 
Ala He Gly His Glu Pro Arg Ser Ser Leu Val Ser Asp Val Val Asp 

420 425 430 



2706 



caa gaa ttg eta gga cgt ggc gtg agt gca tgc gcg acc tgc gac ggg 27 54 
Gin Glu Leu Leu Gly Arg Gly Val Ser Ala Cys Ala Thr Cys Asp Gly 
305 310 315 320 



2802 



2850 



2898 



2946 



gtc gcg gtg aac ggg tat aca aca gtg acc gga ttg egg ttg cgt aac 2994 

Val Ala Val Asn Gly Tyr Thr Thr Val Thr Gly Leu Arg Leu Arg Asn 
385 390 395 400 

acc aca acg gga gag gaa acc acg eta gta gtg acc ggg gtt ttt gtt 3 042 

Thr Thr Thr Gly Glu Glu Thr Thr Leu Val Val Thr Gly Val Phe Val 

405 410 415 



3090 



ata gac ccg gat ggc tac gtc ctg gtg aaa gga cgt acg acg agt aca 3138 

He Asp Pro Asp Gly Tyr Val Leu Val Lys Gly Arg Thr Thr Ser Thr 
435 440 445 

teg atg gac ggc gtt ttt gcg gec ggc gac ctg gta gat cgc acc tac 3186 

Ser Met Asp Gly Val Phe Ala Ala Gly Asp Leu Val Asp Arg Thr Tyr 
450 455 460 

egg cag gcg ate act gec gca ggt agt ggc tgt gec gee gee ate gac 3234 

Arg Gin Ala He Thr Ala Ala Gly Ser Gly Cys Ala Ala Ala He Asp 
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465 470 475 480 

gcc gaa cgt tgg ttg gcg gag cat gcc ggg tea aaa get aac gaa aca 3282 
Ala Glu Arg Trp Leu Ala Glu His Ala Gly Ser Lys Ala Asn Glu Thr 

485 490 495 

aca gag gaa act gga gac gtt gac agt acc gac aca acc gat tgg age 3 330 
Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr Thr Asp Trp Ser 

500 505 510 

act gcg atg act gac gcc aag aac gcc ggg gtc aca ata gaa gtg acc 3378 
Thr Ala Met Thr Asp Ala Lys Asn Ala Gly Val Thr lie Glu Val Thr 
515 520 525 

gat get tec ttt ttc gca gac gtc tta tec agt aat aag cct gtg tta 3426 
Asp Ala Ser Phe Phe Ala Asp Val Leu Ser Ser Asn Lys Pro Val Leu 
530 535 540 

gtt gat ttt tgg gca aca tgg tgt gga ccc tgc aag atg gta gcg ccg 3474 
Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys Lys Met Val Ala Pro 
545 550 " 555 560 

gta etc gaa gag ate gcg tec gaa caa cga aac cag etc act gtc gcc 3522 
Val Leu Glu Glu lie Ala Ser Glu Gin Arg Asn Gin Leu Thr Val Ala 

565 570 575 

aag tta gat gta gac acc aac. ccg gaa atg gca cgc gag ttc cag gtc 3 570 
Lys Leu Asp Val Asp Thr Asn Pro Glu Met Ala Arg Glu Phe Gin Val , 

580 • 585 590 



gtg teg ata ccc aca atg att ctg ttc cag ggt ggc caa cca gta aaa 
Val Ser lie Pro Thr Met lie Leu Phe Gin Gly Gly Gin Pro Val Lys 
595 600 605 



3618 



cgc ate gtt ggc get aag ggc aaa gca gcg tta eta cgt gac ctt tec 3 666 
Arg lie Val Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg Asp Leu Ser 
610 ~ 615 620 

gac gtg gta cct aac etc aat taa gctttaaata agtatgaact aaaatgcatg 3 720 
Asp Val Val Pro Asn Leu Asn * 
625 630 

taggtgtaag agctcatgga gagcatggaa tattgtatcc gaccatgtaa cagtataata 3780 
actgagctcc atctcacttc ttctatgaat aaacaaagga tgttatgata tattaacact 3840 
etatctatge accttattgt tctatgataa atttcctctt attattataa atcatctgaa 3 900 
tegtgaegge ttatggaatg cttcaaatag tacaaaaaca aatgtgtact ataagacttt 3 960 
ctaaacaatt ctaactttag cattgtgaac gagacataag tgttaagaag acataacaat 4020 
tataatggaa gaagtttgtc tccatttata tattatatat tacccactta tgtattatat 4080 
taggatgtta aggagacata acaattataa agagagaagt ttgtatccat ttatatatta 4140 
tatactaccc atttatatat tatacttatc cacttattta atgtctttat aaggtttgat 42 00 
ccatgatatt tctaatattt tagttgatat gtatatgaaa gggtactatt tgaactctct 4260 
tactctgtat aaaggttgga tcatccttaa agtgggtcta tttaatttta ttgettctta 4320 
cagataaaaa aaaaattatg agttggtttg ataaaatatt gaaggattta aaataataat 4380 
aaataataaa taacatataa tatatgtata taaatttatt ataatataac atttatctat 4440 
aaaaaagtaa atattgtcat aaatctatac aatcgtttag ccttgctgga cgactctcaa 4500 
ttatttaaac gagagtaaac atatttgact ttttggttat ttaacaaatt attatttaac 4560 
actatatgaa attttttttt tttateggea aggaaataaa attaaattag gagggacaat 4620 
ggtgtgtccc aatccttata .caaccaactt ccacaggaag gtcaggtegg ggacaacaaa 4680 
aaaacaggca agggaaattt tttaatttgg gttgtcttgt ttgetgeata atttatgeag 4740 
taaaacacta cacataaccc ttttagcagt agagcaatgg ttgaccgtgt gcttagcttc 4800 
ttttatttta tttttttatc agcaaagaat aaataaaata aaatgagaca cttcagggat 4860 
gtttcaaccc ttatacaaaa ccccaaaaac aagtttccta gcaecctacc aactaaggta 4920 



cc 

<210> 31 
<211> 118 
<212> PRT 



4922 
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<213> Artificial Sequence 
<220> 

<223> Chimeric 
<400> 31 



Met 


Ala 


Asp 


Thr 


Ala 


Arg Gly Thr 


His 


His 


Asp 


He 


He 


Gly Arg 


Asp 


1 






5 




10 










15 


Gly 


Gin 


Tyr 


Pro 


Met 


Met 


Gly Arg Asp 


Arg 


Asp 


Gin 


Tyr 


Gin 


Met 


Ser 






20 




25 










30 


Ala 


Thr 


Arg 


Gly 


Ser 


Asp 


Tyr 


Ser Lys Ser 


Arg 


Gin 


He 


Ala 


Lys 


Ala 


35 




40 










45 




Thr 




Ala 


Val 


Thr 


Ala 


Gly 


Gly Ser Leu 


Leu 


Val 


Leu 


Ser 


Ser 


Leu 


Leu 




50 






55 








60 






Val 


He 


Val 


Gly 


Thr 


Val 


He 


Ala Leu Thr 


Val 


Ala 


Thr 


Pro 


Leu 


Leu 


65 








70 






75 










80 


Phe 


Ser 


Pro 


He 


Leu 


Val Pro Ala 


Leu 


He 


Thr 


Val 


Ala 


Leu 


Leu 


He 






85 






90 










95 


Val 


Thr 


Gly 


Phe 


Leu 


Ser 


Ser Gly Gly 


Phe 


Gly 


He 


Ala 


Ala 


He 


Thr 






100 






105 










110 






Phe 


Ser 


Trp 
115 


He 


Tyr 


Lys 



















<210> 32 
<211> 513 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Chimeric 



<400> 32 


















Tyr 


Ala 


Thr 


Gly Glu 


His 


Pro 


Gin 


Gly Ser Asp 


1 








5 










10 




Arg 


Met 


Lys 


Leu 


Gly 


Ser 


Lys 


Ala 


Gin Asp 


Leu 






20 










25 




His 


Tyr 


Tyr 


Gly 


Gin 


Gin 


His 


Thr Gly 


Gly 


Glu 


35 










40 








Arg 


Gly 


Gly 


Gin 


His 


Thr 


Thr 


Met 


Asn 


Thr 


Thr 


50 










55 










Thr 


He 


His 


Glu 


Val 


He 


Val 


He 


Gly 


Ser 


Gly 


65 










70 










75 


Ala 


Ala 


Leu 


Tyr 


Ala 


Ala 


Arg 


Ala 


Gin 


Leu 


Thr 








85 










90 




Gly 


Thr 


Ser 


Phe 


Gly 


Gly Ala 


Leu 


Met 


Thr 


Thr 






100 










105 






Tyr 


Pro 


Gly 


Phe 


Arg 


Asn Gly 


He 


Thr Gly 


Pro 




115 










120 






Glu 


Met 


Arg 
130 


Glu 


Gin 


Ala 


Leu 


Arg 
135 


Phe 


Gly 


Ala 


Val 


Glu 


Ser 


Val 


Ser 


Leu 


Arg 


Gly 


Pro 


He 


Lys 


145 










150 










155 


Glu 


Gly 


Gin 


Thr 


Tyr 


Gin 


Ala 


Arg 


Ala 


Val 


He 








165 










170 




Ser 


Val 


Arg 


Tyr 


Leu 


Gin 


He 


Pro 


Gly 


Glu 


Gin 






180 










185 






Gly 


Val 


Ser 


Ala 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Ser 




195 










200 








Asp 


lie 


Ala 


Val 


He 


Gly Gly Gly Asp 


Ser 


Ala 


210 










215 










Phe 


Leu 


Thr 


Arg 


Phe 


Ala 


Arg 


Ser 


Val 


Thr 


Leu 


225 








230 










235 


Glu 


Phe 


Arg 


Ala 


Ser 


Lys 


He 


Met 


Leu Gly Arg 








245 










250 




Lys 


He 


Lys 


Phe 


He 


Thr 


Asn 


His 


Thr 


Val 


Val 




260 










265 







15 

Lys ABp Arg Ala Gin 
30 

Asp Arg Asp Arg Thr 
45 

Pro Ser Ala His Glu 
60 

Pro Ala Gly Tyr Thr 

80 

Pro Leu Val Phe Glu 

95 

Thr Glu Val Glu Asn 
110 

Glu Leu Met Asp Asp 
125 

Leu Arg Thr Glu Asp 
14 0 

Ser Val Val Thr Ala 

160 

Leu Ala Met Gly Thr 

175 

Glu Leu Leu Gly Arg 
190 

Phe Phe Arg Gly Gin 
205 

Met Glu Glu Ala Leu 
220 

Val His Arg Arg Asp 

240 

Ala Arg Asn Asn Asp 

255 

Ala Val Asn Gly Tyr 
270 
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X X A-L. 


TVi ** 


V CX J- 




Glv 


Leu 


Arci 


Leu 


Arg 


Asn 


Thr 


Thr 


Thr 


Gly 


Glu 


Glu 






275 








280 










285 








J. LXJ— 


Thr 


TiF*n 
uw *-x 


Val 

V «-x~ 


Val 


Thr 


Glv 


Val 


Phe 


Val 


Ala 


lie 


Gly 


His 


Glu 


Pro 




290 










295 










300 






Gly 




Aver 


v_J X. 


J ^ — • -4-. 


Leu 


Val 


Ser 


Asp 


Val 


Val 


Asp 


lie 


Asp 


Pro 


Asp 


Tyr 


305 










310 










315 










320 


v CL _1_ 


T ,<=m i 

XJC u. 


V CX JL. 


T,vg 


Glv 


Arcr 


Thr 


Thr 


Ser 


Thr 


Ser 


Met 


Asp 


Gly Val 


Phe 








325 










330 










335 


Ala 


7\ 1 -a 


Al a 




A Cir"J 
no L/ 


JJw LX 


Val 


Asd 


Aro 


Thr 


Tyr 


Arcr 


Gin 


Ala 


He 


Thr 






340 










345 










350 




Ala 


Ala 




O C J- 






Ala 


Ala 


Ala 


lie Asp 


Ala 


Glu 


Arg 


Trp 


Leu 




355 








360 










365 








VJl 


His 

XX JL 9 


Ala 


Glv 


Ser 


Lvs 


Ala 


Asn 


Glu 


Thr 


Thr 


Glu 


Glu 


Thr 


Gly Asp 




370 

— J 1 w 






375 










380 








Ala 


Val 


A«5T*> 


1— * »w« -A— 


Thir 


Asp 


Thr 


Thr 


Asp 


Trp 


Ser 


Thr 


Ala 


Met 


Thr Asp 


385 








390 










395 










400 




A en 


A 1 =a 
rtX CX 


ux y 


Val 


J* J. J- JU 


lie 


Glu 

\_J --L- M- 


Val 


Thr 


Asp 


Ala 


Ser 


Phe 


Phe 


Ala 






405 










410 










415 






Val 


JJC LX 




Ser 


Asn 


Lys 


Pro 


Val 


Leu 


val 


Asp 


Phe 


Trp 


Ala 


Thr 






420 










425 










430 




Ala 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Met 


Val 


Ala 


Pro 


Val 


Leu 


Glu 


Glu 


He 


435 










440 










445 






Thr 


Ser 


Glu 


Gin 


Arg 


Asn 


Gin 


Leu 


Thr 


Val 


Ala 


Lys 


Leu 


Asp 


Val 


Asp 




450 








455 










460 






Thr 




Asn 


Pro 


Glu 


Met 


Ala 


Arg 


Glu 


Phe 


Gin 


Val 


Val 


Ser 


He 


Pro 


Met 


465 










470 










475 










480 


lie 


Leu 


Phe 


Gin 


Gly 


Gly 


Gin 


Pro 


Val 


Lys 


Arg 


lie 


Val 


Gly 


Ala 


Lys 










485 








490 










495 




Gly 


Lys 


Ala 


Ala 


Leu 


Leu 


Arg 


Asp 


Leu 


Ser 


Asp 


Val 


Val 


Pro 


Asn 


Leu 




500 










505 










510 







Asn 



<210> 33 
<211> 4935 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1554) . . . (1906) 



<221> CDS 

<222> (2147) . . . (3701) 
<223> Chimeric 



<400> 33 

ctgcaggaat 

tggtttttta 

ttacttgtta 

tatccctaca 

ccgtctatct 

agcgttggta 

caatataaac 

ttatcaaata 

aatttaactc 

agecattttt 

aaatttcacc 

ataattccca 

cttctaaagt 

tattacttct 

aattgttgaa 

agtttaaagt 

tataacattt 

atgtaagtcc 



tcattgtact 
cctctattta 
ctttaatttc 
aatttattat 
ttaatgtagt 
gaaagcataa 
aaattcttta 
tttttcaacc 
cataattttt 
ttctttgaat 
aaacaatcat 
tttgacacta 
aattttaata 
aaaaaattaa 
tttgtgacta 
aaatataagt 
atggtggact 
gtaactagaa 



cccagtatca 
aaggggtttt 
tcataatctt 
ttgttaaaca 
ctaacatttt 
agatttattc 
ccttaagaag 
aegtaaatet 
ttattcgact 
aaaaaaatcc 
ttgtggtatt 
eggaagtaac 
atagttacta 
ttagatataa 
ttgatttatt 
aatgt agtag 
aattttcata 
ttacagtggg 



ttatagtgaa 
ccacctaaaa 
tggttgaaat 
ttttcaaacc 
catattgaaa 
ttattcttct 
gatttcccat 
cataataata 
gatcttaaag 
aattatcatt 
tctgaagcaa 
tgaagatctg 
tattcaagat 
ttaaaatatt 
attctactat 
agtgttagag 
tatttcttat 
ttgccatggc 



agttttggct 
attctggtat 
tatcacgett 
gcataaaatt 
tatataattt 
tcatataaat 
tttatatttt 
agttgtttca 
caacacccag 
gtattttttt 
gtcatgttat 
cttttacatg 
t tcatatatc 
acttttttaa 
gtttaaattg 
tgttacccta 
tgettttace 
actctgtggt 



ctctcgccgg 60 
cattctcact 120 
ccgcacacga 18 0 
ttatgaagtc 24 0 
acttaatttt 300 
gtttaatata 360 
aaaaatatat 42 0 
aaagtaataa 48 0 
tgacacaact 540 
tatacaatga 600 
gcaaaattct 660 
cgagacacat 72 0 
aaatactcaa 780 
ttttaagttt 840 
ttttatagat 900 
aaccataaac 96 0 
ttttcttggt 1020 
cttttggttc 1080 
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atgcatgggt cttgcgcaag aaaaagacaa 
caaaacgcaa tcacacaacc aactcaaatt 
atgtatgtct aaatgccatg caaagcaaca 
ccatctcaac ccacacacaa acacattgcc 
atatattcat tctcttccgc cacctcaatt 
tgcgtgtcat cccatgccca aatctccatg 
tacctataaa tacctctaat atcactcact 
ctactctact actataatac cccaacccaa 



agaacaaaga aaaaagacaa aacagagaga 1140 

agtcactggc tgatcaagat cgccgcgtcc 12 00 

cgtgcbtaac atgcacttta aatggctcac 1260 

tttttcttca tcatcaccac aaccacctgt 1320 

tcttcacttc aacacacgtc aacctgcata 1380 

catgttccaa ccaccttctc tcttatataa 1440 

tctttcatca tccatccatc cagagtacta 150 0 

ctcatattca atactactct act atg 1556 

Met 



gcg gat aca get aga gga acc cat cac gat ate ate ggc aga gac cag 1604 

Ala Asp Thr Ala Arg Gly Thr His Hie Asp He He Gly Arg Asp Gin 

5 10 15 

tac ccg atg atg ggc cga gac cga gac cag tac cag atg tec gga cga 1652 

Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly Arg 

20 25 30 

gga tct gac tac tec aag tct agg cag att get aaa get gca act get 1700 

Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr Ala 

35 " " 40 45 

gtc aca get ggt ggt tec etc ctt gtt etc tec age ctt acc ctt gtt 1748 

Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val 

50 * 55 60 65 

gga act gtc ata get ttg act gtt gca aca cct ctg etc gtt ate ttc 1796 

Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe 

70 75 80 



age cca ate ctt gtc ccg get etc ate aca gtt gca etc etc ate acc 
Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr 

85 90 95 



1844 



ggt ttt ctt tec tct gga ggg ttt ggc att gec get ata acc gtt ttc 1892 
Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val Phe 
100 105 HO 

tct tgg att tac aa gtaagcacac atttatcatc ttacttcata attttgtgca 1946 
Ser Trp He Tyr Lys 
115 

atatgtgcat gcatgtgttg agecagtage tttggatcaa tttttttggt cgaataacaa 2 006 
atgtaacaat aagaaattgc aaattctagg gaacatttgg ttaactaaat acgaaatttg 2066 
acctagctag cttgaatgtg tctgtgtata tcatctatat aggtaaaatg cttggtatga 2126 
tacctattga ttgtgaatag g tac gca acg gga gag cac cca cag gga tea 2177 

" ~ "* Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 

120 125 

gac aag ttg gac agt gca agg atg aag ttg gga age aaa get cag gat 2225 
Asp Lys Leu ABp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

ctg aaa gac aga get cag tac tac gga cag caa cat act ggt ggg gaa 2273 
Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
145 150 155 160 

cat gac cgt gac cgt act cgt ggt ggc cag cac act acc atg aat ggt 2321 
His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Met Asn Gly 

165 170 175 



etc gaa act cac aac aca agg etc tgt ate gta gga agt ggc cca gcg 
Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser Gly Pro Ala 

180 185 190 



2369 



gca cac acg gcg gcg att tac gca get agg get gaa ctt aaa cct 



ctt 2417 
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Ala His Tbr Ala Ala lie Tyr Ala Ala Arg Ala Glu Leu Lys Pro Leu 
195 200 205 

etc ttc gaa gga tgg atg get aac gac ate get ccc ggt ggt caa eta 2465 
Leu Phe Glu Gly Trp Met Ala Asn Asp lie Ala Pro Gly Gly Gin Leu 
210 ^ 215 220 

aca ace acc acc gac gtc gag aat ttc ccc gga ttt cca gaa ggt att 2513 
Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Glu Gly He 
225 230 235 240 

etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag cga ttc 2561 
Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu Arg Phe 

245 250 255 



ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc tct teg 
Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe Ser Ser 

260 265 ' 270 



get gtt tgc gac gga get get ccg ata ttc cgt aac aaa cct ctt gcg 

Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro Leu Ala 

325 330 335 

gtg ate ggt gga ggc gat tea gca atg gaa gaa gca aac ttt ctt aca 

Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe Leu Thr 

340 ~ 345 350 

aaa tat gga tct aaa gtg tat ata ate cat agg aga gat get ttt aga 

Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp Ala Phe Arg 
355 " 360 365 



tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat gag cca 
Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His Glu Pro 

42 0 425 43 0 



2609 



aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc get gac get 2657 

Lys' Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala Asp Ala 
275 280 285 

gtg att etc get act gga get gtg get aag egg ctt age ttc gtt gga 2705 

Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser Phe Val Gly 
290 295 300 

tct ggt gaa ggt tct gga ggt ttc tgg aac cgt gga ate tec get tgt 2753 

Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly He Ser Ala Cys 

305 " ~ 310 315 320 



2801 



2849 



2897 



gcg tct aag att atg cag cag cga get ttg tct aat cct aag att gat 2945 

Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys He Asp 
370 375 380 

gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga gaa aga 2993 

Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly Glu Arg 

385 390 395 400 

gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga gat gtt 3 041 

Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly Asp Val 

405 410 415 



3089 



get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat ggt tat 3137 

Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp Gly Tyr 
435 440 445 

gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga gtt ttc 3185 

Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro Gly Val Phe 

450 ~ 455 460 



-33- 



WO 02/50289 



PCTYUSO 1/50240 



get gcg ggt gat gtt cag gat aag aag tat agg caa gec ate act get 3233 
Ala Ala Gly Asp Val Gin Asp Lye Lys Tyr Arg Gin Ala He Thr Ala 
465 470 475 480 

gca gga act ggg tgc atg gca get ttg gat gca gag cat tac tta caa 32 Bl 
Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr Leu Gin 

485 490 495 

gag att get gga teg aag get aac gag acc acc gag gaa act gga gat 33 29 
Glu He Ala Gly Ser Lys Ala Asn Glu Thr Thr Glu Glu Thr Gly Asp 

500 505 510 

gtt gac teg acg gat act acg gat tgg teg acg get atg gaa gaa gga 3377 
Val Asp Ser Thr Asp Thr Thr Asp Trp Ser Thr Ala Met Glu Glu Gly 
515 520 525 

caa gtg ate gee tgc cac acc gtt gag aca tgg aac gag cag ctt cag 3425 
Gin Val He Ala Cys His Thr Val Glu Thr Trp Asn Glu Gin Leu Gin 
530 * 535 540 

aag get aat gaa tec aaa act ctt gtg gtg gtt gat ttc acg get tct 3473 
Lys Ala Asn Glu Ser Lys Thr Leu Val Val Val Asp Phe Thr Ala Ser 
545 550 555 560 

tgg tgt gga cca tgt cgt ttc ate get cca ttc ttt get gat ttg get 3521 
Trp Cys Gly Pro Cys Arg Phe He Ala Pro Phe Phe Ala Asp Leu Ala 

565 570 575 

aag aaa ctt cct aac gtg ctt ttc etc aag. gtt gat act gat gaa ttg 3569 
Lys Lys Leu Pro Asn Val Leu Phe Leu Lys Val Asp Thr Asp Glu Leu 

580 585 590 

aag teg gtg gca agt gat tgg gcg ata cag gcg atg cca acc ttc atg 3617 
Lys Ser Val Ala Ser Asp Trp Ala He Gin Ala Met Pro Thr Phe Met 
595 600 605 

ttt ttg aag gaa ggg aag att ttg gac aaa gtt gtt gga gee aag aaa 3 665 
Phe Leu Lys Glu Gly Lys He Leu Asp Lys Val Val Gly Ala Lys Lys 
610 615 620 

gat gag ctt cag tct acc att gee aaa cac ttg get taagcttaaa 3711 
Asp Glu Leu Gin Ser Thr He Ala Lys His Leu Ala 
625 630 635 

taagtatgaa etaaaatgea tgtaggtgta agagctcatg gagagcatgg aatattgtat 3771 
ccgaccatgt aacagtataa taactgagct ccatctcact tcttctatga ataaacaaag 3831 
gatgttatga tatattaaca ctctatctat gcaccttatt gttctatgat aaatttcctc 3891 
ttattattat aaatcatctg aatcgtgacg gcttatggaa tgcttcaaat agtacaaaaa 3 951 
caaatgtgta ctataagact ttctaaacaa ttctaacttt agcattgtga acgagacata 4 011 
agtgttaaga agacataaca attataatgg aagaagtttg tctccattta tatattatat 4071 
attacccact tatgtattat attaggatgt taaggagaca taacaattat aaagagagaa 4131 
gtttgtatcc atttatatat tatatactac ccatttatat attatactta tccacttatt 4191 
taatgtcttt ataaggtttg atccatgata tttctaatat tttagttgat atgtatatga 4251 
aagggtacta tttgaactct cttactctgt ataaaggttg gatcatcctt aaagtgggtc 4311 
tatttaattt tattgettet tacagataaa aaaaaaatta tgagttggtt tgataaaata 4371 
ttgaaggatt taaaataata ataaataata aataacatat aatatatgta tataaattta 4431 
ttataatata acatttatct ataaaaaagt aaatattgtc ataaatctat acaatcgttt 4491 
agecttgetg gacgactctc aattatttaa acgagagtaa acatatttga ctttttggtt 4 551 
atttaacaaa ttattattta acactatatg aaattttttt tttttategg caaggaaata 4611 
aaattaaatt aggagggaca atggtgtgtc ccaatcctta tacaaccaac ttccacagga 4671 
aggtcaggtc ggggacaaca aaaaaacagg caagggaaat tttttaattt gggttgtctt 4731 
gtttgetgea taatttatgc agtaaaacac tacacataac ccttttagca gtagagcaat 4791 
ggttgaccgt gtgettaget tcttttattt tattttttta tcagcaaaga ataaataaaa 4 851 
taaaatgaga cacttcaggg atgtttcaac ccttatacaa aaccccaaaa acaagtttcc 4 911 
tagcacccta ccaactaagg tacc 4935 

<210> 34 
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<211> 


118 




<212> 


PRT 




<213> 


Artif icia 


<220> 








<221> 


SITE 




<222> 


(1) . . • 


(11 


<223> 


oleosin 


<223> 


Chimeric 


<400> 


34 




Met Ala 


Asp 


Thr 


1 








Gin Tyr 


Pro 


Met 








20 


Arg Gly 


Ser 


Asp 






35 




Ala Val 


Thr 


Ala 


50 






Val Gly 


Thr 


Val 


65 








Phe Ser 


Pro 


He 


Thr Gly 


Phe 


Leu 








100 


Phe Ser 


Trp 


He 






115 





Sequence 



) 



Ala Arg Gly Thr 
5 

Met Gly Arg Asp 

Tyr Ser Lys Ser 

40 

Gly Gly Ser Leu 
55 

He Ala Leu Thr 
70 

Leu Val Pro Ala 
85 

Ser Ser Gly Gly 
Tyr Lys 



His His Asp He 
10 

Arg Asp Gin Tyr 
25 

Arg Gin lie Ala 

Leu Val Leu Ser 

60 

Val Ala Thr Pro 
75 

Leu He Thr Val 
90 

Phe Gly He Ala 
105 



He Gly Arg Asp 
15 

Gin Met Ser Gly 
30 

Lys Ala Ala Thr 
45 

Ser Leu Thr Leu 

Leu Leu Val He 

80 

Ala Leu Leu He 
95 

Ala He Thr Val 
110 



<210> 35 
<211> 518 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SITE 
<222> (1) . . . (55) 
<223> oleosin 

<221> SITE 

<222> (56) . . - (383) 

<223> thioredoxin reductase 

<221> SITE 

<222> (384) . . . (406) 

<223> linker 

<221> SITE 

<222> (407) . . . (518) 

<223> thioredoxin 

<223> Chimeric 

<400> 35 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Asp Ser Ala 

15 10 15 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 

20 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg Asp Arg Thr 

"35 40 4 5 

Arg Gly Gly Gin His Thr Thr Met Asn Gly Leu Glu Thr His Asn Thr 

50 ^ 55 60 

Arg Leu Cys He Val Gly Ser Gly Pro Ala Ala His Thr Ala Ala lie 
65 70 75 80 

Tyr Ala Ala Arg Ala Glu Leu Lys Pro Leu Leu Phe Glu Gly Trp Met 

85 90 95 
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Ala Asn Asp He 

100 

Glu Asn Phe Pro 
115 

Asp Lys Phe Arg 
130 

Glu Thr Val Thr 
145 

Thr Asp Ser Lys 

Ala Val Ala Lys 

180 

Gly Phe Trp Asn 
195 

Ala Pro He Phe 
210 

Ser Ala Met Glu 
225 

Tyr lie lie His 

Gin Arg Ala Leu 

260 

Val Val Glu Ala 
275 

Lys Val Lys Asn 
2 90 

Gly Leu Phe Phe 
305 

Gly Gly Val Glu 

Thr Thr Gin Thr 

340 

Asp Lys Lys Tyr 
355 

Ala Ala Leu Asp 
370 

Ala Asn Glu Thr 
385 

Thr Asp Trp Ser 

Thr Val Glu Thr 

420 

Thr Leu Val Val 
435 

Phe He Ala Pro 
450 

Leu Phe Leu Lys 
465 

Trp Ala He Gin 

He Leu Asp Lys 

500 

He Ala Lys His 
515 



Ala Pro Gly Gly 

Gly Phe Pro Glu 

120 

Lys Gin Ser Glu 
13 5 

Lys Val Asp Phe 
150 

Ala He Leu Ala 
165 

Arg Leu Ser Phe 

Arg Gly He Ser 

200 

Arg Asn Lys Pro 
215 

Glu Ala Asn Phe 
230 

Arg Arg Asp Ala 
245 

Ser Asn Pro Lys 

Tyr Gly Asp Gly 

2B0 

Val Val Thr Gly 
295 

Ala He Gly His 
310 

Leu Asp Ser Asp 
325 

Ser Val Pro Gly 

Arg Gin Ala He 

360 

Ala Glu His Tyr 
375 

Thr Glu Glu Thr 

390 . 
Thr Ala Met Glu 
405 

Trp Asn Glu Gin 

Val Asp Phe Thr 

440 

Phe Phe Ala Asp 
455 

Val Asp Thr Asp 
470 

Ala Met Pro Thr 
485 

Val Val Gly Ala 
Leu Ala 



Gin Leu Thr Thr 
105 

Gly He Leu Gly 

Arg Phe Gly Thr 

140 

Ser Ser Lys Pro 
155 

Asp Ala Val He 
17 0 

Val Gly Ser Gly 
185 

Ala Cys Ala Val 

Leu Ala Val He 

220 

Leu Thr Lys Tyr 
235 

Phe Arg Ala Ser 
250 

lie Asp Val He 
265 

Glu Arg Asp Val 

Asp Val Ser Asp 

300 

Glu Pro Ala Thr 
315 

Gly Tyr Val Val 
330 

Val Phe Ala Ala 
345 

Thr Ala Ala Gly 

Leu Gin Glu He 

380 

Gly Asp Val Asp 
3 95 

Glu Gly Gin Val 
410 

Leu Gin Lys Ala 
425 

Ala Ser Trp Cys 

Leu Ala Lys Lys 

460 

Glu Leu Lys Ser 
475 

Phe Met Phe Leu 
490 

Lys Lys Asp Glu 
505 



Thr Thr Asp Val 
110 

Val Glu Leu Thr 
12 5 

Thr He Phe Thr 

Phe Lys Leu Phe 

160 

Leu Ala Thr Gly 
175 

Glu Gly Ser Gly 
190 

Cys Asp Gly Ala 
205 

Gly Gly Gly Asp 

Gly Ser Lys Val 

240 

Lys lie Met Gin 
255 

Trp Asn Ser Ser 
270 

Leu Gly Gly Leu 
285 

Leu Lys Val Ser 

Lys Phe Leu Asp 

320 

Thr Lys Pro Gly 
335 

Gly Asp Val Gin 
350 

Thr Gly Cys Met 
365 

Ala Gly Ser Lys 

Ser Thr Asp Thr 

400 

He Ala Cys His 
415 

Asn Glu Ser Lys 
430 

Gly Pro Cys Arg 
445 

Leu Pro Asn Val 

Val Ala Ser Asp 

480 

Lys Glu Gly Lys 
495 

Leu Gin Ser Thr 
510 



<210> 36 
<211> 458 
<212> PRT 

<213> Mycobacterium leprae 
<400> 36 

Met Asn Thr Thr Pro Ser Ala His 

1.5 
He Gly Ser Gly Pro Ala Gly Tyr 

20 

Ala Gin Leu Thr Pro Leu Val Phe 



Glu Thr He His Glu Val He Val 

10 15 
Thr Ala Ala Leu Tyr Ala Ala Arg 
25 30 
Glu Gly Thr Ser Phe Gly Gly Ala 
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35 

Leu Met Thr Thr 
50 

He Thr Gly Pro 
65 

Phe Gly Ala Glu 

Gly Pro He Lys 

100 

Arg Ala Val He 
115 

Pro Gly Glu Gin 
130 

Cys Asp Gly Ser 
145 

Gly Asp Ser Ala 

Ser Val Thr Leu 

180 

Met Leu Gly Arg 
195 

His Thr Val Val 
210 

Leu Arg Asn Thr 
225 

Val Phe Val Ala 

Val Val Asp He 

260 

Thr Ser Thr Ser 
275 

Arg Thr Tyr Arg 
290 

Ala He Asp Ala 
305 

Asn Glu Thr Thr 

Asp Trp Ser Thr 

340 

Glu Val Thr Asp 
355 

Pro Val Leu Val 
370 

Val Ala Pro Val 
3 85 

Thr Val Ala Lys 

Phe Gin Val Val 

420 

Pro Val Lys Arg 
435 

Asp Leu Ser Asp 
450 



40 

Thr Glu Val Glu 
55 

Glu Leu Met Asp 
70 

Leu Arg Thr Glu 
85 

Ser Val Val Thr 

Leu Ala Met Gly 

120 

Glu Leu Leu Gly 
135 

Phe Phe Arg Gly 
150 

Met Glu Glu Ala 
165 

Val His Arg Arg 

Ala Arg Asn Asn 

200 

Ala Val Asn Gly 
215 

Thr Thr Gly Glu 
230 

He Gly His Glu 
2 45 

Asp Pro Asp Gly 

Met Asp Gly Val 

2B0 

Gin Ala He Thr 
295 

Glu Arg Trp Leu 
310 

Glu Glu Thr Gly 
325 

Ala Met Thr Asp 

Ala Ser Phe Phe 

360 

Asp Phe Trp Ala 
3 75 

Leu Glu Glu He 
390 

Leu Asp Val Asp 
405 

Ser He Pro Thr 

He Val Gly Ala 

440 

Val Val Pro Asn 
455 



Asn 


Tyr 


Pro 


Gly 








60 


Asp 


Met 


Arg 


Glu 






75 




Asp 


Val 


Glu 


Ser 




90 






Ala 


Glu Gly 


Gin 


105 








Thr 


Ser 


Val 


Arg 


Arg Gly Val 


Ser 








14 0 


Gin 


Asp 


He 


Ala 






155 




Leu 


Phe 


Leu 


Thr 




170 






Asp 


Glu 


Phe 


Arg 


185 








Asp 


Lys 


He 


Lys 


Tyr 


Thr 


Thr 


Val 






220 


Glu 


Thr 


Thr 


Leu 






235 




Pro 


Arg 


Ser 


Ser 




250 






Tyr 


Val 


Leu 


Val 


265 








Phe 


Ala 


Ala 


Gly 


Ala Ala Gly 


Ser 








300 


Ala 


Glu 


His 


Ala 






315 




Asp 


Val 


Asp 


Ser 




330 






Ala 


Lys 


Asn 


Ala 


345 








Ala 


Asp 


Val 


Leu 


Thr 


Trp 


Cys 


Gly 








380 


Ala 


Ser 


Glu 


Gin 






395 




Thr 


Asn 


Pro 


Glu 




410 






Met 


He 


Leu 


Phe 


425 








Lys 


Gly Lys 


Ala 


Leu 


Asn 







45 

Phe Arg Asn Gly 

Gin Ala Leu Arg 

80 

Val Ser Leu Arg 
95 

Thr Tyr Gin Ala 
110 

Tyr Leu Gin lie 
125 

Ala Cys Ala Thr 

Val He Gly Gly 

160 

Arg Phe Ala Arg 
175 

Ala Ser Lys He 
190 

Phe He Thr Asn 
205 

Thr Gly Leu Arg 

Val Val Thr Gly 

240 

Leu Val Ser Asp 
255 

Lys Gly Arg Thr 
270 

Asp Leu Val Asp 
285 

Gly Cys Ala Ala 

Gly Ser Lys Ala 

320 

Thr Asp Thr Thr 
335 

Gly Val Thr He 
350 

Ser Ser Asn Lys 
365 

Pro Cys Lys Met 

Arg Asn Gin Leu 

400 

Met Ala Arg Glu 
415 

Gin Gly Gly Gin 
430 

Ala Leu Leu Arg 
445 



<210> 37 
<211> 471 
<212> PRT 

<213> Arabidopsis thai i ana 
<220> 

<223> Chimeric 
<400> 37 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 

1 5 10 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 
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20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp lie Ala Pro Gly 

35 40 45 

Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 

50 55 60 

Glu Gly lie Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

Glu Arq Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

B5 90 95 

Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 

100 105 HO 

Ala Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser 

115 120 125 

Phe Val Gly Ser Gly Glu Gly Ser Gly Gly Phe Trp Asn Arg Gly lie 

130 * 135 140 

Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 
145 150 155 160 

Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 

165 170 175 

Phe Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp 

180 185 190 

Ala Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro 

195 ~ 200 205 

Lys He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 

210 215 220 

Glv Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 
225 230 ~ ~ 235 240 

Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly 

245 250 255 

His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 

260 265 270 

Asn Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro 

275 280 285 

Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 

290 295 300 

He Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His 
305 310 315 320 

Tyr Leu Gin Glu He Ala Gly Ser Lys Ala Asn Glu Thr Thr Glu Glu 

325 330 335 

Thr Gly Asp Val Asp Ser Thr Asp Thr Thr Asp Trp Ser Thr Ala Met 

340 345 350 

Glu Glu Gly Gin Val He Ala Cys Glu Glu Gly Gin Val He Ala Cys 

355 360 365 

His Thr Val Glu Thr Trp Asn Glu Gin Leu Gin Lys Ala Asn Glu Ser 

370 375 380 

Lys Thr Leu Val Val Val Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys 
385 390 395 400 

Arg Phe He Ala Pro Phe Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn 

405 410 415 

Val Leu Phe Leu Lys Val Asp Thr Asp Glu Leu LyB Ser Val Ala Ser 

420 " 425 430 

Asp Trp Ala He Gin Ala Met Pro Thr Phe Met Phe Leu Lys Glu Gly 

435 440 445 

Lys He Leu Asp Lys Val Val Gly Ala Lys Lys Asp Glu Leu Gin Ser 

450 455 460 

Thr He Ala Lys His Leu Ala 
465 470 



<210> 38 
<211> 345 
<212> DMA 

<213> Arabidopsis thai i ana 

<220> 

<221> CDS 

<222> (1) . . . (345) 
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<400> 38 

atg get teg gaa gaa gga caa gtg ate gec tgc cac acc gtt gag aca 
Met Ala Ser Glu Glu Gly Gin Val lie Ala Cys His Thr Val Glu Thr 
15 10 15 

tgg aac gag cag ctt cag aag get aat gaa tec aaa act ctt gtg gtg 
Trp Asn Glu Gin Leu Gin Lys Ala Asn Glu Ser Lys Thr Leu Val Val 

20 25 30 



ttc ttt get gat ttg get aag aaa ctt cct aac gtg ctt ttc etc aag 

Phe Phe Ala Asp Leu Ala Lys Lys Leu Pro Asn Val Leu Phe Leu Lys 

50 55 60 

gtt gat act gat gaa ttg aag teg gtg gca agt gat tgg gcg ata cag 

Val Asp Thr Asp Glu Leu Lys Ser Val Ala Ser Asp Trp Ala lie Gin 

65 " 70 75 80 

gcg atg cca acc ttc atg ttt ttg aag gaa ggg aag att ttg gac aaa 

Ala Met Pro Thr Phe Met Phe Leu Lya Glu Gly Lys lie Leu Asp Lys 

85 90 95 



ttg get taa 
Leu Ala * 



<210> 39 
<211> 114 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 39 



Met 


Ala 


Ser 


Glu 


Glu 


Gly 


Gin 


Val 


He 


Ala 


Cys His 


Thr 


Val 


Glu 


Thr 


1 








5 








10 








15 


Val 


Trp 


Asn 


Glu 


Gin 


Leu 


Gin 


Lys 


Ala 


Asn 


Glu 


Ser Lys 


Thr 


Leu 


Val 






20 










25 








30 






Val 


Asp 


Phe 


Thr 


Ala 


Ser 


Trp 


Cys 


Gly Pro 


Cys Arg 


Phe 


He 


Ala 


Pro 




35 










40 








45 • 








Phe 


Phe 


Ala 


Asp 


Leu 


Ala 


Lys 


Lys 


Leu 


Pro 


Asn Val 


Leu 


Phe 


Leu 


Lys 




50 








55 








60 








Gin 


Val 


Asp 


Thr 


Asp 


Glu 


Leu 


Lys 


Ser 


Val 


Ala 


Ser Asp 


Trp 


Ala 


He 


65 






70 










75 








80 


Ala 


Met 


Pro 


Thr 


Phe 
85 


Met 


Phe 


Leu 


Lys 


Glu 
90 


Gly Lys 


He 


Leu 


Asp 
95 


Lys 


Val 


Val 


Gly 


Ala 


Lys 


Lys 


Asp 


Glu 


Leu 


Gin 


Ser Thr 


He 


Ala 


Lys 


His 






100 






105 








110 







48 



96 



gtt gat ttc acg get tct tgg tgt gga cca tgt cgt ttc ate get cca 144 
Val Asp Phe Thr Ala Ser Trp Cys Gly Pro Cys Arg Phe He Ala Pro 
35 40 ' 45 



192 



240 



288 



gtt gtt gga gee aag aaa gat gag ctt cag tct acc att gee aaa cac 33 6 
Val Val Gly Ala Lys Lys Asp Glu Leu Gin Ser Thr He Ala Lys His 

100 105 HO 



345 



Leu Ala 



<210> 40 

<211> 999 

<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (1) . . . (999) 
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<400> 40 

atg aat ggt etc gaa act cac aac aca agg etc tgt ate gta gga agt 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys He Val Gly Ser 
15 10 15 

ggc cca gcg gca cac acg gcg gcg att tac gca get agg get gaa ctt 
Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 



ggt caa etc aac caa cca ccg cgt gag aat ttc ccc gga ttt cca gaa 
Gly Gin Leu Asn Gin Pro Pro Arg Glu Asn Phe Pro Gly Phe Pro Glu 
50 55 60 

ggt att etc gga gta gag etc act gac aaa ttc cgt aaa caa teg gag 
Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser Glu 
65 70 75 80 

cga ttc ggt act acg ata ttt aca gag acg gtg acg aaa gtc gat ttc 
Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp Phe 

85 90 95 



gac get gtg att etc get ate gga get gtg get aag tgg ctt age ttc 
Asp Ala Val He Leu Ala He Gly Ala Val Ala Lys Trp Leu Ser Phe 
115 120 125 



ttt aga gcg tct aag att atg cag cag cga get ttg tct aat cct aag 
Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro Lys 
195 200 205 



gat gtt tct gat tta aaa gtt tct gga ttg ttc ttt get att ggt cat 
Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly His 

245 250 255 



48 



96 



aaa cct ctt etc ttc gaa gga tgg atg get aac gac ate get ccc ggt 144 
Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 
35 "* 40 45 



192 



240 



288 



tct teg aaa ccg ttt aag eta ttc aca gat tea aaa gee att etc get 336 
Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu Ala 

100 105 HO 



384 



480 



gtt gga tct ggt gaa gtt etc gga ggt ttg tgg aac cgt gga ate tec 432 

Val Gly Ser Gly Glu Val Leu Gly Gly Leu Trp Asn Arg Gly He Ser 
130 135 140 

get tgt get gtt tgc gac gga get get ccg ata ttc cgc aac aaa cct 

Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys Pro 
145 * 150 155 160 

ctt gcg gtg ate ggt gga ggc gat tct gca atg gaa gaa gca aac ttt 

Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn Phe 

165 " 170 175 



528 



ctt aca aaa tat gga tct aaa gtg tat ata ate gat agg aga gat get 576 
Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He Asp Arg Arg Asp Ala 

180 185 190 



624 



att gat gtg att tgg aac teg tct gtt gtg gaa get tat gga gat gga 672 
lie Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp Gly 
210 215 220 

gaa aga gat gtg ctt gga gga ttg aaa gtg aag aat gtg gtt acc gga 720 
Glu Arg Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr Gly 
225 230 235 240 



768 



gag cca get acc aag ttt ttg gat ggt ggt gtt gag tta gat teg gat 816 
Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser Asp 
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260 265 270 

ggt tat gtt gtc acg aag cct ggt act aca cag act age gtt ccc gga 864 
Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Glri Thr Ser Val Pro Gly 
275 280 285 

gtt ttc get gcg ggt gat gtt cag gat aag aag tat agg caa gec ate 912 
Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala He 
290 295 300 

act get gca gga act ggg tgc atg gca get ttg gat gca gag cat tac 960 
Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His Tyr 
305 " 310 315 320 

tta caa gag att gga tct cag caa ggt aag agt gat tga 999 
Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp * 

325 330 

<210> 41 
<211> 332 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 41 



Met 


Asn 


Gly 


Leu 


1 








Gly 


Pro 


Ala 


Ala 






20 


Lys 


Pro 


Leu 


Leu 






35 




Gly 


Gin 


Leu 


Asn 




50 






Gly 


He 


Leu 


Gly 


65 








Arg 


Phe 


Gly 


Thr 


Ser 


Ser 


Lys 


Pro 








100 


Asp 


Ala 


Val 


He 






115 




Val 


Gly 


Ser 


Gly 




130 






Ala 


Cys 


Ala 


Val 


145 








Leu 


Ala 


Val 


He 


Leu 


Thr 


Lys 


Tyr 








180 


Phe 


Arg 


Ala 


Ser 






195 




He 


Asp 


Val 


He 




210 






Glu 


Arg 


Asp 


Val 


225 








Asp 


Val 


Ser 


Asp 


Glu 


Pro 


Ala 


Thr 








260 


Gly 


Tyr 


Val 


Val 






275 




Val 


Phe 


Ala 


Ala 




290 






Thr 


Ala 


Ala 


Gly 


305 








Leu 


Gin 


Glu 


He 



Lj J-UL 


l nr 


TT J _. 

ill S 


7\ r~~1 T~x 


5 








TT 2 — 

ill S 


inr 


ZV 1 =1 


+\X a. 


rxie 


tslU 


uiy 


l rp 








40 


Gin 


Pro 


Pro 


Arg 






55 




Val 


Glu 


Leu 


Thr 




70 






Thr 


He 


Phe 


Thr 


85 








Phe 


Lys 


Leu 


Phe 


Leu 


Ala 


He 


Gly 








120 


Glu 


Val 


Leu 


Gly 






135 




Cys 


Asp 


Gly 


Ala 




150 






Gly 


Gly 


Gly 


Asp 


165 








Gly 


Ser 


Lys 


Val 


Lys 


He 


Met 


Gin 








200 


Trp 


Asn 


Ser 


Ser 




215 




Leu 


Gly 


Gly 


Leu 




230 






Leu 


Lys 


Val 


Ser 


245 








Lys 


Phe 


Leu 


Asp 


Thr 


Lys 


Pro 


.Gly 








280 


Gly 


Asp 


Val 


Gin 






295 




Thr 


Gly 


Cys 


Met 




310 






Gly 


Ser 


Gin 


Gin 


325 









Thr Arg Leu Cys 
10 

He Tyr Ala Ala 
25 

Met Ala Asn Asp 

Glu Asn Phe Pro 

60 

Asp Lys Phe Airg 
75 

Glu Thr Val Thr 
90 

Thr Asp Ser Lys 
105 

Ala Val Ala Lys 

Gly Leu Trp Asn 

140 

Ala Pro He Phe 
155 

Ser Ala Met Glu 
170 

Tyr He He Asp 
185 

Gin Arg Ala Leu 

Val Val Glu Ala 

220 

Lys Val Lys Asn 
235 

Gly Leu Phe Phe 
250 

Gly Gly Val Glu 
265 

Thr Thr Gin Thr 

Asp Lys Lys Tyr 

300 

Ala Ala Leu Asp 
315 

Gly Lys Ser Asp 
330 



He Val Gly Ser 
15 

Arg Ala Glu Leu 
30 

He Ala Pro Gly 
45 

Gly Phe Pro Glu 

Lys Gin Ser Glu 

80 

Lys Val Asp Phe 
95 

Ala He Leu Ala 
110 

Trp Leu Ser Phe 
125 

Arg Gly He Ser 

Arg Asn Lys Pro 

160 

Glu Ala Asn Phe 
175 

Arg Arg Asp Ala 
190 

Ser Asn Pro Lys 
205 

Tyr Gly Asp Gly 

Val Val Thr Gly 

240 

Ala He Gly His 
255 

Leu Asp Ser Asp 
270 

Ser Val Pro Gly 
285 

Arg Gin Ala He 

Ala Glu His Tyr 

320 
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<210> 42 
<211> 332 
<212> DNA 
<213> E. coli 

<220> 

<221> CDS 

<222> (1) . . . (332) 

<400> 42 

atg age gat aaa att att cac ctg act gac gac agt ttt gac acg gat 

Met Ser Asp Lys lie lie His Leu Thr Asp Asp Ser Phe Asp Thr Asp 

15 10 15 

gta etc aaa gcg gac ggg get ate etc gtt gat ttc tgg gca gag tgg 
Val Leu Lys Ala Asp Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp 

20 25 30 



gaa tat cag ggc aaa ttg acc gtt gee aaa ctg aac att gac cag aac 
Glu Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn 
5 0 55 6 0 

cca ggt act gcg cct aaa tat ggc ate cgc ggt att ccg act ctg ctg 
Pro Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu 
65 70 ~ 75 80 

ctg ttt aaa aac ggc gaa gtg gcg gca acc aaa gta ggc gca ctg tct 
Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 

85 90 x 95 



aaa ggt cag ttg aaa gag ttt etc gac gec aat ctg gcg taa ta 
Lys Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala 

100 105 



* 



<210> 43 

<211> 109 

<212> PRT 

<213> E. coli 

<400> 43 

Met Ser Asp Lys He He His Leu Thr Asp Asp Ser Phe Asp Thr Asp 

15 10 15 

Val Leu Lys Ala Asp Gly Ala He Leu Val Asp Phe Trp Ala Glu Trp 

20 25 30 

Cys Gly Pro Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp 

35 40 45 

Glu Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Asn He Asp Gin Asn 

50 ^ 55 6Q 

Pro Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu 
65 70 75 80 

Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 

85 90 95 

Lys Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala 

100 ~ 105 



48 



96 



tgc ggg ccg tgt aaa atg ate get ccg att ctg gat gaa ate get gac 144 
Cys Gly Pro Cys Lys Met He Ala Pro He Leu Asp Glu He Ala Asp 
35 40 45 



192 



240 



288 



332 



<210> 44 
<211> 966 
<212> DNA 
<213> E. coli 
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<220> 

<221> CDS 

<222> (1) . . . (966) 



<400> 44 

atg ggc acg acc aaa cac agt aaa ctg ctt ate ctg ggt tea ggc ccg 

Met Gly Thr Thr Lys His Ser Lys Leu Leu lie Leu Gly Ser Gly Pro 

15 10 15 

gcg gga tac acc get get gtc tac gcg gcg cgc gee aac ctg caa cct 

Ala Gly Tyr Thr Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Gin Pro 

20 25 30 

gtg ctg att acc ggc atg gaa aaa ggc ggc caa ctg acc acc acc acg 

Val Leu lie Thr Gly Met Glu Lys Gly Gly Gin Leu Thr Thr Thr Thr 

35 40 45 



tta atg gag cgc atg cac gaa cat gee acc aag ttt gaa act gag ate 

Leu Met Glu Arg Met His Glu His Ala Thr Lys Phe Glu Thr Glu lie 

65 70 75 80 

att ttt gat cat ate aac aag gtg gat ctg caa aac cgt ccg ttc cgt 

lie Phe Asp His lie Asn Lys Val Asp Leu Gin Asn Arg Pro Phe Arg 

85 90 95 



acc gga get tct gca cgc tat etc ggc ctg ccc tct gaa gaa gee ttt 
Thr Gly Ala Ser Ala Arg Tyr Leu Gly Leu Pro Ser Glu Glu Ala Phe 
115 ~ 120 125 



gag gcg ttg tat ctg tct aac ate get teg gaa gtg cat ctg att cac 
Glu Ala Leu Tyr Leu Ser Asn He Ala Ser Glu Val His Leu He His 

165 170 175 



gat acg caa aac age gat aac ate gag tea etc gac gtt gec ggt ctg 

Asp Thr Gin Asn Ser Asp Asn He Glu Ser Leu Asp Val Ala Gly Leu 

225 230 235 240 

ttt gtt get ate ggt cac age ccg aat act gcg att ttc gaa ggg cag 

Phe Val Ala He Gly His Ser Pro Asn Thr Ala He Phe dlu Gly Gin 



48 



96 



144 



gaa gtg gaa aac tgg cct ggc gat cca aac gat ctg acc ggt ccg tta 192 
Glu Val Glu Asn Trp Pro Gly Asp Pro Asn Asp Leu Thr Gly Pro Leu 
50 55 60 



240 



288 



ctg aat ggc gat aac ggc gaa tac act tgc gac gcg ctg att att gee 33 6 
Leu Asn Gly Asp Asn Gly Glu Tyr Thr Cys Asp Ala Leu He He Ala 

100 105 HO 



384 



aaa ggc cgt ggg gtt tct get tgt gca acc tgc gac ggt ttc ttc tat 432 

Lys Gly Arg Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe Tyr 
130 135 140 

cgc aac cag aaa gtt gcg gtc ate ggc ggc ggc aat acc gcg gtt gaa 

Arg Asn Gin Lys Val Ala Val He Gly Gly Gly Asn Thr Ala Val Glu 

145 ^ 150 155 160 



480 



528 



cgc cgt gac ggt ttc cgc gcg gaa aaa ate etc att aag cgc ctg atg 576 
Arg Arg Asp Gly Phe Arg Ala Glu Lys He Leu He Lys Arg Leu Met 

180 ' 185 ISO 

gat aaa gtg gag aac ggc aac ate att ctg cac acc aac cgt acg ctg 
Asp Lys Val Glu Asn Gly Asn He He Leu His Thr Asn Arg Thr Leu 
195 200 205 



624 



gaa gaa gtg acc ggc gat caa atg ggt gtc act ggc gtt cgt ctg cgc 672 
Glu Glu Val Thr Gly Asp Gin Met Gly Val Thr Gly Val Arg Leu Arg 
210 ~ 215 220 



720 



768 
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245 250 255 

ctg gaa ctg gaa aac ggc tac ate aaa gta cag teg ggt att cat ggt 

Leu Glu Leu Glu Asn Gly Tyr He Lys Val Gin Ser Gly He His Gly 

260 265 270 

aat gec acc cag acc age att cct ggc gtc ttt gec gca ggc gac gtg 

Asn Ala Thr Gin Thr Ser lie Pro Gly Val Phe Ala Ala Gly Asp Val 

275 280 285 



atg gca gca ctt gat gcg gaa cgc tac etc gat ggt tta get gac gca 
Met Ala Ala Leu Asp Ala Glu Arg Tyr Leu Asp Gly Leu Ala Asp Ala 
305 310 315 320 

* 

aaa taa 
Lys * 



816 



864 



atg gat cac att tat cgc cag gec att act teg gec ggt aca ggc tgc 912 
Met Asp His He Tyr Arg Gin Ala He Thr Ser Ala Gly Thr Gly Cys 
290 ~ 295 300 



960 



966 



<210> 45 
<211> 321 
<212> PRT 
<213> E. coli 



<400> 45 
Met Gly Thr Thr 
1 

Ala Gly Tyr Thr 

20 

Val Leu He Thr 
35 

Glu Val Glu Asn 
50 

Leu Met Glu Arg 
65 

He Phe Asp His 

Leu Asn Gly Asp 

100 

Thr Gly Ala Ser 
115 

Lys Gly Arg Gly 
130 

Arg Asn Gin Lys 
145 

Glu Ala Leu Tyr 

Arg Arg Asp Gly 

180 

Asp Lys Val Glu 
195 

Glu Glu Val Thr 
210 

Asp Thr Gin Asn 
225 

Phe Val Ala He 

Leu Glu Leu Glu 

260 

Asn Ala Thr Gin 
275 

Met Asp His He 
290 



Lys His Ser Lys 
5 

Ala Ala Val Tyr 

Gly Met Glu Lys 

4 0 

Trp Pro Gly Asp 
55 

Met His Glu His 
70 

He Asn Lys Val 
85 

Asn Gly Glu Tyr 

Ala Arg Tyr Leu 

120 

Val Ser Ala Cys 
135 

Val Ala Val He 
150 

Leu Ser Asn He 
165 

Phe Arg Ala Glu 

Asn Gly Asn He 

200 

Gly Asp Gin Met 
215 

Ser Asp Asn He 
230 

Gly His Ser Pro 
245 

Asn Gly Tyr He 

Thr Ser He Pro 

280 

Tyr Arg Gin Ala 

295 



Leu Leu He Leu 
10 

Ala Ala Arg Ala 

25 

Gly Gly Gin Leu 

Pro Asn Asp Leu 

60 

Ala Thr Lys Phe 
75 

Asp Leu Gin Asn 
90 

Thr Cys Asp Ala 
105 

Gly Leu Pro Ser 

Ala Thr Cys Asp 

140 

Gly Gly Gly Asn 
155 

Ala Ser Glu Val 
170 

Lys He Leu He 
185 

He Leu His Thr 

Gly Val Thr Gly 

220 

Glu Ser Leu Asp 
235 

Asn Thr Ala He 
250 

Lys Val Gin Ser 
265 

Gly Val Phe Ala 

He Thr Ser Ala 

300 



Gly Ser Gly Pro 
15 

Asn Leu Gin Pro 
30 

Thr Thr Thr Thr 
45 

Thr Gly Pro Leu 

Glu Thr Glu He 

80 

Arg Pro Phe Arg 
95 

Leu He He Ala 
110 

Glu Glu Ala Phe 
125 

Gly Phe Phe Tyr 

Thr Ala Val Glu 

160 

His Leu He His 
175 

Lys Arg Leu Met 
190 

Asn Arg Thr Leu 
205 

Val Arg Leu Arg 

Val Ala Gly Leu 

240 

Phe Glu Gly Gin 
255 

Gly He His Gly 
270 

Ala Gly Asp Val 
285 

Gly Thr Gly Cys 
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Met Ala Ala Leu Asp Ala Glu Arg Tyr Leu Asp Gly Leu Ala Asp Ala 
305 310 315 320 

Lys 



<210> 46 

<211> 318 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (318) 

<400> 46 

atg gtg aag cag ate gag age aag act get ttt cag gaa gee ttg gac 

Met Val Lys Gin lie Glu Ser Lys Thr Ala Phe Gin Glu Ala Leu Asp 

15 10 15 

get gca ggt gat aaa ctt gta gta gtt gac ttc tea gee acg tgg tgt 
Ala Ala Gly Asp Lys Leu Val Val Val Asp Phe Ser Ala Thr Trp Cys 

20 ^ 25 30 



48 



96 



ggg cct tgc aaa atg ate aag cct ttc ttt cat tec etc tct gaa aag 144 
Gly Pro Cys Lys Met He Lys Pro Phe Phe His Ser Leu Ser Glu Lys 
35 40 45 

tat tec aac gtg ata ttc ctt gaa gta gat gtg gat gac tgt cag gat 192 
Tyr Ser Asn Val He Phe Leu Glu Val Asp Val Asp Asp Cys Gin Asp 
50 55 60 

gtt get tea gag tgt gaa gtc aaa tgc atg cca aca ttc cag ttt ttt 
Val Ala Ser Glu Cys Glu Val Lys Cys Met Pro Thr Phe Gin Phe Phe 
65 70 75 80 

aag aag gga caa aag gtg ggt gaa ttt tct gga gee aat aag gaa aag 
Lys Lys Gly Gin Lys Val Gly Glu Phe Ser Gly Ala Asn Lys Glu Lys 

85 90 95 



240 



288 



ctt gaa gee ace att aat gaa tta gtc taa 318 
Leu Glu Ala Thr He Asn Glu Leu Val * 

100 105 



<210> 47 

<211> 105 

<212> PRT 

<213> Homo Sapien 

<400> 47 



Met 


Val 


Lys 


Gin 


He 


Glu 


Ser 


Lys 


Thr 


Ala 


Phe 


Gin 


Glu 


Ala 


Leu 


Asp 


1 






5 










10 










15 




Ala 


Ala 


Gly 


Asp 


Lys 


Leu 


Val 


Val 


Val 


Asp 


Phe 


Ser 


Ala 


Thr 


Trp 


Cys 






20 








25 










30 






Gly 


Pro 


Cys 


Lys 


Met 


He 


Lys 


Pro 


Phe 


Phe 


His 


Ser 


Leu 


Ser 


Glu 


Lys 




35 








40 










45 








Tyr 


Ser 


Asn 


Val 


He 


Phe 


Leu 


Glu 


Val 


Asp 


Val 


Asp 


Asp 


Cys 


Gin 


Asp 


50 










55 










60 








Phe 


Val 


Ala 


Ser 


Glu 


Cys 


Glu 


Val 


Lys 


Cys 


Met 


Pro 


Thr 


Phe 


Gin 


Phe 


65 








70 










75 










80 


Lys 


Lys 


Gly 


Gin 


Lys 


Val 


Gly Glu Phe 


Ser 


Gly 


Ala 


Asn 


Lys 


Glu 


Lys 




85 










90 










95 




Leu 


Glu 


Ala 


Thr 
100 


He 


Asn 


Glu 


Leu 


Val 
105 
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<210> 48 

<211> 1494 

<212> DNA 

<213> Homo sapien 

<220> 

<221> CDS 

<222> (1) . . . (1494) 

<400> 48 ) 
atg aac ggc cct gaa gat ctt ccc aag tec tat gac tat gac ctt ate 
Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp Tyr Asp Leu lie 
1 5 10 15 

ate att gga ggt ggc tea gga ggt ctg gca get get aag gag cca gee 
lie He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Pro Ala 

20 25 30 

caa tat ggc aag aag gtg atg gtc ctg gac ttt ggc act ccc ace cct 
Gin Tyr Gly Lys Lys Val Met Val Leu Asp Phe Gly Thr Pro Thr Pro 
35 40 45 



48 



96 



144 



240 



ctt gga act aga tgg ggt ctt gga gga aca tgt gtg aat gtg ggt tgc 192 
Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 
50 55 60 

ata cct aaa aaa ctg atg cat caa gca get ttg tta gga caa gee ctg 
He Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 80 

caa gac tct cga aat tat gga tgg aaa gtc gag gag aca gtt aag cat 
Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu Thr Val Lys His 

85 90 9S 



288 



gat tgg gac aga atg ata gaa get gta cag aat cac att ggc tct ttg 33 6 
Asp Trp Asp Arg Met lie Glu Ala Val Gin Asn His He Gly Ser Leu 

100 105 HO 

aat tgg ggc tac cga gta get ctg egg gag aaa aaa gtc gtc tat gag 3 84 
Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 
115 ^ 120 125 

aat get tat ggg caa ttt att ggt cct cac agg att aag gca aca aat 43 2 » 
Asn Ala Tyr Gly Gin Phe He Gly Pro His Arg He Lys Ala Thr Asn 
130 * 135 140 

aat aaa ggc aaa gaa aaa att tat tea gca gag aga ttt etc att gec 
Asn Lys Gly Lys Glu Lys He Tyr Ser Ala Glu Arg Phe Leu He Ala 
145 ^ ** 150 155 160 

act ggt gaa aga cca cgt tac ttg ggc ate cct ggt gac aaa gaa tac 
Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 

165 170 175 

tgc ate age agt gat gat ctt ttc tec ttg cct tac tgc ccg ggt aag 576 
Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 " 185 190 

aca ctg gtt gtt gga gca tec tat gtc get ttg gag tgc get gga ttt 
Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 
195 200 205 

ctt get ggt att ggt tta gac gtc act gtt atg gtt agg tec att ctt 
Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 
210 " ^ 215 220 

ctt aga gga ttt gac cag gac atg gec aac aaa att ggt gaa cac atg 



480 



528 



624 



672 



720 
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Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He Gly Glu His Met 

225 ~ ' 230 235 240 

gaa gaa cat ggc ate aag ttt ata' aga cag ttc gta cca att aaa gtt 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro He Lys Val 

245 250 255 

gaa caa att gaa gca ggg aca cca ggc cga etc aga gta gta get cag 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Val Ala Gin 

260 265 270 

tec ace aat agt gag gaa ate att gaa gga gaa tat aat acg gtg atg 

Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr Asn Thr Val Met 
275 280 285 

ctg gca ata gga aga gat get tgc aca aga aaa att ggc tta gaa ace 

Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He Gly Leu Glu Thr 
290 295 300 

gta ggg gtg aag ata aat gaa aag act gga aaa ata cct gtc aca gat 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 

305 310 315 320 

gaa gaa cag ace aat gtg cct tac ate tat gee att ggc gat ata ttg 
Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 

325 330 335 



ctg get cag agg etc tat gca ggt tec act gtc aag tgt gac tat gaa 
Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys Cys Asp Tyr Glu 
355 360 365 



ctt tct gag gag aaa get gtg gag aag ttt ggg gaa gaa aat att gag 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 

385 390 395 400 

gtt tac cat agt tac ttt tgg cca . ttg gaa tgg acg att ccg tea aga 

Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr He Pro Ser Arg 

405 410 415 

gat aac aac aaa tgt tat gca aaa ata ate tgt aat act aaa gac aat 
Asp Asn Asn Lys Cys Tyr Ala Lys He lie Cys Asn Thr Lys Asp Asn 

420 425 430 



ctg gac age aca att gga ate cac cct gtc tgt gca gag gta ttc aca 

Leu Asp Ser Thr He Gly He His Pro Val Cys Ala Glu Val Phe Thr 

465 470 475 480 

aca ttg tct gtg ace aag cgc tct ggg gca age ate etc cag get ggc 

Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser He Leu Gin Ala Gly 

485 ~ 490 495 



768 



816 



864 



912 



960 



1008 



gag gat aag gtg gag etc acc cca gtt gca ate cag gca gga aga ttg 1056 
Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350 



1104 



aat gtt cca acc act gta ttt act cct ttg gaa tat ggt get tgt ggc 1152 
Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Ala Cys Gly 
370 375 380 



1200 



1248 



1296 



gaa cgt gtt gtg ggc ttt cac gta ctg ggt cca aat get gga gaa gtt 13 44 
Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 
435 440 445 

aca caa ggc ttt gca get gcg etc aaa tgt gga ctg acc aaa aag cag 13 92 
Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu Thr Lys Lys Gin 
450 * 455 460 



1440 



1488 
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tgc tga 1494 
Cys * 



<210> 49 

<211> 497 

<212> PRT 

<213> Homo sapien 

<400> 49 . 

Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp Tyr Asp Leu lie 

15 10 15 

He He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu Pro Ala 

20 25 30 

Gin Tyr Gly Lys Lys Val Met Val Leu Asp Phe Gly Thr Pro Thr Pro 

35 " 40 45 

Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val Asn Val Gly Cys 

50 55 60 

He Pro Lys Lys Leu Met His Gin Ala Ala Leu Leu Gly Gin Ala Leu 
65 70 75 80 

Gin Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu Thr Val Lys His 

85 90 95 

Asp Trp Asp Arg Met He Glu Ala Val Gin Asn His He Gly Ser Leu 

100 105 HO 

Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys Val Val Tyr Glu 

115 120 125 

Asn Ala Tyr Gly Gin Phe He Gly Pro His Arg He Lys Ala Thr Asn 

130 * 135 14'0 

Asn Lys Gly Lys Glu Lys He Tyr Ser Ala Glu Arg Phe Leu He Ala 
145 150 155 160 

Thr Gly Glu Arg Pro Arg Tyr Leu Gly He Pro Gly Asp Lys Glu Tyr 

165 170 175 

Cys He Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr Cys Pro Gly Lys 

180 ~ " 185 190 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ala Gly Phe 

195 200 205 

Leu Ala Gly He Gly Leu Asp Val Thr Val Met Val Arg Ser He Leu 

210 215 220 

Leu Arg Gly Phe Asp Gin Asp Met Ala Asn Lys He Gly Glu His Met 
225 230 235 240 

Glu Glu His Gly He Lys Phe He Arg Gin Phe Val Pro He Lys Val 

245 250 255 

Glu Gin He Glu Ala Gly Thr Pro Gly Arg Leu Arg Val Val Ala Gin 

260 265 270 

Ser Thr Asn Ser Glu Glu He He Glu Gly Glu Tyr Asn Thr Val Met 

275 280 285 

Leu Ala He Gly Arg Asp Ala Cys Thr Arg Lys He Gly Leu Glu Thr 

290 ' 295 300 

Val Gly Val Lys He Asn Glu Lys Thr Gly Lys He Pro Val Thr Asp 
305 310 ~ 315 320 

Glu Glu Gin Thr Asn Val Pro Tyr He Tyr Ala He Gly Asp He Leu 

325 330 335 

Glu Asp Lys Val Glu Leu Thr Pro Val Ala He Gin Ala Gly Arg Leu 

340 345 350 

Leu Ala Gin Arg Leu Tyr Ala Gly Ser Thr Val Lys Cys Asp Tyr Glu 

355 360 365 

Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr Gly Ala Cys Gly 

370 375 380 

Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu Glu Asn He Glu 
385 390 395 400 

Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr He Pro Ser Arg 

405 410 415 

Asp Asn Asn Lys Cys Tyr Ala Lys He He Cys Asn Thr Lys Asp Asn 

420 425 430 

Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn Ala Gly Glu Val 
435 440 445 
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Thr Gin Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu Thr Lys Lys Gin 

450 455 460 

Leu Asp Ser Thr lie Gly He His Pro Val Cys Ala Glu Val Phe Thr 
465 470 475 480 

Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser He Leu Gin Ala Gly 

485 490 495 

Cys 



<210> 50 
<211> 1377 
<212> DNA 

<213> Mycobacterium leprae 

<220> 

<221> CDS 

<222> (1) . . . (1377) 

<400> 50 

atg aac acc act cct tct gcg cat gag acg ata cac gaa gtg ate gtt 
Met Asn Thr Thr Pro Ser Ala His Glu Thr He His Glu Val He Val 
1 5 10 15 

att ggc tec ggt cca gca ggc tac act get gec ctg tac gec get cgt 
He Gly Ser Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg 

20 * 25 30 



ata acc ggc ccg gag ttg atg gac gat atg cgt gaa cag gca ctg cga 
He Thr Gly Pro Glu Leu Met Asp Asp Met Arg Glu Gin Ala Leu Arg 
65 70 75 80 

ttc ggc gcg gaa ctg egg acc gaa gac gtc gag teg gta tea ttg cgt 
Phe Gly Ala Glu Leu Arg Thr Glu Asp Val Glu Ser Val Ser Leu Arg 

85 90 95 



48 



96 



gca cag eta aca ccg ctg gta ttt gag ggt acc tea ttc ggc ggc gcg 144 

Ala Gin Leu Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala 
35 40 45 

ctg atg acc acc acc gag gtg gaa aac tac cca ggt ttt cgc aac ggc 192 

Leu Met Thr Thr Thr Glu Val Glu Asn Tyr Pro Gly Phe Arg Asn Gly 
50 55 60 



240 



288 



ggc ccg ate aaa teg gtc gtc acc get gaa gga cag act tat cag gec 336 
Gly Pro He Lys Ser Val Val Thr Ala Glu Gly Gin Thr Tyr Gin Ala 

100 105 HO 

cga gec gtc ate etc gec atg ggt acc tec gtg cgt tat eta cag ate 384 
Arg Ala Val He Leu Ala Met Gly Thr Ser Val Arg Tyr Leu Gin He 
115 120 125 

ccc ggc gag caa gaa ttg eta gga cgt ggc gtg agt gca tgc gcg acc 432 
Pro Gly Glu Gin Glu Leu Leu Gly Arg Gly Val Ser Ala Cys Ala Thr 
130 135 140 

tgc gac ggg tec ttt ttc cgc ggc caa gac att gee gtc att ggc ggt 
Cys Asp Gly Ser Phe Phe Arg Gly Gin Asp lie Ala Val He Gly Gly 
145 150 ~ 155 160 

gga gac tea gcg atg gag gaa gec etc ttt ttg acc egg ttc gec cgc 
Glv Asp Ser Ala Met Glu Glu Ala Leu Phe Leu Thr Arg Phe Ala Arg 

165 170 175 

age gtc acg etc gtg cac cgc cgc gac gaa ttc cga get tct aag ate 
Ser Val Thr Leu Val His Arg Arg Asp Glu Phe Arg Ala Ser Lys He 

180 185 190 



480 



528 



576 
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atg etc ggt cgc gee cgt aac aat gac aag ate aaa ttc ate ace aac 624 
Met Leu Gly Arg Ala Arg Asn Asn Asp Lys lie Lys Phe lie Thr Asn 
195 200 205 

cac ace gtg gtc gcg gtg aac ggg tat aca aca gtg acc gga ttg egg 672 
His Thr Val Val Ala Val Asn Gly Tyr Thr Thr Val Thr Gly Leu Arg 
210 215 220 

ttg cgt aac acc aca acg gga gag gaa acc acg eta gta gtg acc ggg 720 
Leu Arg Asn Thr Thr Thr Gly Glu Glu Thr Thr Leu Val Val Thr Gly 
225 230 * 235 240 

gtt ttt gtt gca att ggc cat gaa cca cgt tec age ctg gtg age gat 768 
Val Phe Val Ala lie Gly His Glu Pro Arg Ser Ser Leu Val Ser Asp 

245 250 255 

gtc gtc gac ata gac ccg gat ggc tac gtc ctg gtg aaa gga cgt acg 816 
Val Val Asp lie Asp Pro Asp Gly Tyr Val Leu Val Lys Gly Arg Thr 

260 " " 265 270 

acg agt aca teg atg gac ggc gtt ttt gcg gec ggc gac ctg gta gat 8 64 
Thr Ser Thr Ser Met Asp Gly Val Phe Ala Ala Gly Asp Leu Val Asp 
275 280 285 

cgc acc tac egg cag gcg ate act gee gca ggt agt ggc tgt gee gec 912 
Arg Thr Tyr Arg Gin Ala lie Thr Ala Ala Gly Ser Gly Cys Ala Ala 
290 " 295 300 

gee ate gac gee gaa cgt tgg ttg gcg gag cat gee ggg tea aaa get 960 
Ala lie Asp Ala Glu Arg Trp Leu Ala Glu His Ala Gly Ser Lys Ala 
305 310 315 320 

aac gaa aca aca gag gaa act gga gac gtt gac agt acc gac aca acc 100 8 
Asn Glu Thr Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr Thr 

325 330 335 

gat tgg age act gcg atg act gac gec aag aac gec ggg gtc aca ata 1056 
Asp Trp Ser Thr Ala Met Thr Asp Ala Lys Asn Ala Gly Val Thr lie 

340 ' 345 350 

gaa gtg acc gat get tec ttt ttc gca gac gtc tta tec agt aat aag 1104 
Glu Val Thr Asp Ala Ser Phe Phe Ala Asp Val Leu Ser Ser Asn Lys 
355 360 365 

cct gtg tta gtt gat ttt tgg gca aca tgg tgt gga ccc tgc aag atg 1152 
Pro Val Leu Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys Lys Met 
370 375 380 

gta gcg ccg gta etc gaa gag ate gcg tec gaa caa cga aac cag etc 1200 
Val Ala Pro Val Leu Glu Glu lie Ala Ser Glu Gin Arg Asn Gin Leu 
385 390 395 400 

act gtc gee aag tta gat gta gac acc aac ccg gaa atg gca cgc gag 124 B 
Thr Val Ala Lys Leu Asp Val Asp Thr Asn Pro Glu Met Ala Arg Glu 

405 410 415 

ttc cag gtc gtg teg ata ccc aca atg att ctg ttc cag ggt ggc caa 1296 
Phe Gin Val Val Ser lie Pro Thr Met He Leu Phe Gin Gly Gly Gin 

420 425 430 

cca gta aaa cgc ate gtt ggc get aag ggc aaa gca gcg tta eta cgt 1344 
Pro Val Lys Arg He Val Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg 
435 440 445 

gac ctt tec gac gtg gta cct aac etc aat tag 1377 
Asp Leu Ser Asp Val Val Pro Asn Leu Asn * 
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450 455 

<210> 51 
<211> 458 
<212> PRT 

<213> Mycobacterium leprae 

Met^Asn^Thr Thr Pro Ser Ala His Glu Thr He His Glu Val lie Val 

15 10 15 

He Gly Ser Gly Pro Ala Gly Tyr Thr Ala Ala Leu Tyr Ala Ala Arg 

20 25 30 

Ala Gin Leu Thr Pro Leu Val Phe Glu Gly Thr Ser Phe Gly Gly Ala 

35 40 45 

Leu Met Thr Thr Thr Glu Val Glu Asn Tyr Pro Gly Phe Arg Asn Gly 

50 55 60 

He Thr Gly Pro Glu Leu Met Asp Asp Met Arg Glu Gin Ala Leu Arg 
65 70 75 80 

Phe Gly Ala Glu Leu Arg Thr Glu Asp Val Glu Ser Val Ser Leu Arg 

85 90 95 

Gly Pro He Lys Ser Val Val Thr Ala Glu Gly Gin Thr Tyr Gin Ala 

100 105 H° 

Arg Ala Val He Leu Ala Met Gly Thr Ser Val Arg Tyr Leu Gin He 

115 120 125 

Pro Gly Glu Gin Glu Leu Leu Gly Arg Gly Val Ser Ala Cys Ala Thr 

130 135 140 

Cys Asp Gly Ser Phe Phe Arg Gly Gin Asp He Ala Val He Gly Gly 
145 150 155 160 

Gly Asp Ser Ala Met Glu Glu Ala Leu Phe Leu Thr Arg Phe Ala Arg 

165 170 175 

Ser Val Thr Leu Val His Arg Arg Asp Glu Phe Arg Ala Ser Lys He 

180 185 190 

Met Leu Gly Arg Ala Arg Asn Asn Asp Lys He Lys Phe He Thr Asn 

195 200 205 

His Thr Val Val Ala Val Asn Gly Tyr Thr Thr Val Thr Gly Leu Arg 

210 215 220 

Leu Arg Asn Thr Thr Thr Gly Glu Glu Thr Thr Leu Val Val Thr Gly 
225 230 235 240 

Val Phe Val Ala He Gly His Glu Pro Arg Ser Ser Leu Val Ser Asp 

245 " 250 255 

Val Val Asp He Asp Pro Asp Gly Tyr Val Leu Val Lys Gly Arg Thr 

260 ~ 265 270 

Thr Ser Thr Ser Met Asp Gly Val Phe Ala Ala Gly Asp Leu Val Asp 

275 280 285 

Arg Thr Tyr Arg Gin Ala He Thr Ala Ala Gly Ser Gly Cys Ala Ala 

290 ^ 295 300 

Ala He Asp Ala Glu Arg Trp Leu Ala Glu His Ala Gly Ser Lys Ala 
305 310 315 320 

Asn Glu Thr Thr Glu Glu Thr Gly Asp Val Asp Ser Thr Asp Thr Thr 

325 330 335 

Asp Trp Ser Thr Ala Met Thr Asp Ala Lys Asn Ala Gly Val Thr He. 

340 345 350 

Glu Val Thr Asp Ala Ser Phe Phe Ala Asp Val Leu Ser Ser Asn Lys 

355 360 365 

Pro Val Leu Val Asp Phe Trp Ala Thr Trp Cys Gly Pro Cys Lys Met 

370 * 375 380 

Val Ala Pro Val Leu Glu Glu He Ala Ser Glu Gin Arg Asn Gin Leu 
385 390 395 400 

Thr Val Ala Lys Leu Asp Val Asp Thr Asn Pro Glu Met Ala Arg Glu 

405 410 415 

Phe Gin Val Val Ser He Pro Thr Met He Leu Phe Gin Gly Gly Gin 

420 425 430 

Pro Val Lys Arg He Val Gly Ala Lys Gly Lys Ala Ala Leu Leu Arg 

435 440 445 

Asp Leu Ser Asp Val Val Pro Asn Leu Asn 
450 455 
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<210> 52 
<211> 178 
<212> PRT 

<213> Arabidopsia thaliana 
<400> 52 

Met Pro Leu Ser Leu Arg Leu Ser Pro Ser Pro Thr Ala Leu Ser Pro 

1 5 10 15 

Thr Thr Gly Gly 'Phe Gly Pro Ser Arg Lys Gin Cys Arg lie Pro Tyr 

20 25 30 

Ser Gly Val Pro Thr Thr Lys He Gly Phe Cys Ser Leu Asp Ser Arg 

35 40 45 

Lys Arg Gly Asp Ser Ser Val Val Arg Cys Ser Leu Glu Thr Val Asn 

50 ^ 55 60 

Val Ser Val Gly Gin Val Thr Glu Val Asp Lys Asp Thr Phe Trp Pro 
65 70 75 80 

He Val Lys Ala Ala Gly Glu Lys Leu Val Val Leu Asp Met Tyr Thr 

85 90 95 

Gin Trp Cys Gly Pro Cys Lys Val He Ala Pro Lys Tyr Lys Ala Leu 

100 105 HO 

Ser Glu Lys Tyr Asp Asp Val Val Phe Leu Lys Leu Asp Cys Asn Pro 

115 " * 120 125 

Asp Asn Arg Pro Leu Pro Lys Glu Leu Gly He Arg Val Val Pro Thr 

• 130 135 I 40 

Phe Lys He Leu Lys Asp Asn Lys Val Val Lys Glu Val Thr Gly Ala 
145 150 155 160 

Lys Tyr Asp Asp Leu Val Ala Ala He Glu Thr Ala Arg Ser Ala Ala 

165 170 175 

Ser Gly 



<210> 53 
<211> 185 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 53 
Met Pro Leu Ser 
1 

Ser Pro He Thr 

20 

His Cys Arg He 
35 

Ser Gly Gly Gly 
50 

Val Val Arg Cys 
65 

Thr Glu Val Asp 

Asp Lys He Val 

100 

Lys Val He Ala 
115 

Met Val Phe Leu 
130 

Lys Glu Leu Gly 
145 

Asn Lys Val Val 

Ala Ala He Glu 

180 



Leu Arg Leu Ala 
5 

Ser Thr Gly Ala 

Pro Asn Ser Gly 

40 

Gly Val Leu Asp 
55 

Ser Leu Glu Thr 
70 

Lys Asp Thr Phe 
85 

Val Leu Asp Met 

Pro Lys Tyr Lys 

120 

Lys Leu Asp Cys 
135 

He Arg Val Val 
150 

Lys Glu Val Thr 
165 

Ala Ala Arg Ser 



Pro Ser Pro Thr 
10 

Gly Gly Phe Ser 
25 

Val Ala Thr Lys 

Ser Gly Arg Arg 

60 

Val Asn Val Thr 
75 

Trp Pro He Val 
90 

Tyr Thr Gin Trp 
105 

Glu Leu Ser Glu 

Asn Gin Asp Asn 

14 0 

Pro Thr Phe Lys 
155 

Gly Ala Lys Tyr 
170 

Gly 
185 



Ser Phe Arg Tyr 
15 

Pro Val Lys Gin 
30 

He Gly Phe Cys 
45 

He Gly Ser Cys 

Val Gly Gin Val 

80 

Lys Ala Ala Gly 
95 

Cys Gly Pro Cys 
110 

Lys Tyr Gin Asp 
125 

Lys Pro Leu Ala 

He Leu Lys Asp 

160 

Glu Asp Leu Leu 
175 



<210> 54 
<211> 182 
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<212> PRT 

<213> Brassica napus 
<400> 54 





xr x y~> 




Rf>r 

*o t^. -*<- 


Leu 


Ara; 


Leu 


Ala 


Pro 


Ser 


Pro 


Thr 


Ala 


Leu 


Ser 


Pro 


1 








5 








10 










15 




T'Vi v 


J. liX 


uxy 




Php 

<c lie; 


Gpr 


Pro 


Ala 


Lys 


LVS 


Gin 


Cys 


Arg 


He 


Pro 


Ser 






20 










25 










30 






xyr 


OCX 




Val 


Ala 


Thr 


Thr 


Thr 


Arg 


Ara; 


He 


Gly 


Leu 


Cys 


Ser 


Leu 




35 










40 










45 






Gin 




Tut 


Val 

V a, J- 


JiVS 


Ara 


Gly 


Asp 


Ser 


Ser 


Val 


Val 


Arg 


Cys 


Ser 


Leu 


50 










55 










60 








Thr 


TVi t~ 

J. ilx 






Val 




Val 


Glv 


Gin 


Val 


Thr 


Glu 


Val 


Asp 


Lys 


Asp 


65 










70 








75 






Val 




80 


Phe 


Trp 


Pro 


lie 


Val 


Lys 


Ala 


Ala 


Gly 


Glu 


Lys 


He 


Val 


Leu 


Asp 








85 








90 










95 




Met 


Tyr 


Thr 


Gin 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Val 


He 


Ala 


Pro 


Lys 


Tyr 






100 








105 










110 






Lys 


Ala 


Leu 


Ser 


Glu 


Lys 


Tyr 


Glu Asp 


Val 


Val 


Phe 


Leu 


Lys 


Leu 


Asp 




115 










120 










125 






Val 


Cys 


Asn 


Pro 


Glu 


Asn 


Arg 


Pro 


Leu 


Ala 


Lys 


Glu 


Leu Gly 


He 


Arg 


130 








135 










14 0 








Val 


Val 


Pro 


Thr 


Phe 


Lys 


He 


Leu 


Lys 


Asp 


Asn 


Gin 


Val 


Val 


Lys 


Glu 


145 








150 










155 






Thr 


Ala 


160 


Thr 


Gly 


Ala 


Lys 


Tyr 


Asp 


Asp 


Leu 


Val 


Ala 


Ala 


lie 


Glu 


Arg 






165 










170 










175 




Ser 


Ala 


Ser 


Ser 
180 


Ser 


Gly 























<210> 55 
<211> 191 
<212> PRT 

<213> Mesembryanthemum crystallinum 



<400> 55 




















Ala 






Leu 


Met 


Ala 


Met 


Gin 


Leu 


Ser 


Leu 


Ser 


His 


Gin 


Ser 


Trp 


Lys 


Ser 


1 








5 










10 










15 




Ala 


Ser 


Pro 


He 


Thr 


Ser 


Phe 


Asp 


Pro 


Ala 


Arg 


Ser 


Pro 


Pro 


Lys 


Arg 








20 










25 










30 






Val 


Glu 


Leu 


Gly 


Pro 


Asn 


Cys 


Leu 


Asn 


Gly 


Gly 


Ala 


Thr 


Ala 


Gly 


Lys 






35 








40 










45 






Ser 


Leu 


Met 


Arg 


Glu 


Lys 


Val 


Gly Glu Arg 


Met 


Arg 


Met 


Ser 


Gly Arg 




50 








55 










60 








Glu 


Cys 


Cys 


Val 


Lys 


Ala 


Ser 


Leu 


Glu 


Thr 


Ala 


Val 


Gly 


Ala 


Glu 


Ser 


65 






70 










75 










80 


Thr 


Leu 


Val 


Gly 


Lys 


Val 


Thr 


Glu 


Val 


Asp 


Lys 


Asp 


Thr 


Phe 


Trp 


Pro 








85 










90 










95 


Thr 


He 


Ala 


Asn Gly 


Ala Gly 


Asp 


Lys 


Pro 


Val 


Val 


Leu 


Asp 


Met 


Tyr 








100 










105 










110 


Glu 




Gin 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Val 


Met 


Ala 


Pro 


Lys 


Tyr 


Gin 


Leu 




115 








120 










125 






Gin 


Ala 


Glu 


Lys 


Leu 


Leu 


Asp 


Val 


Val 


Phe 


Leu 


Lys 


Leu 


Asp 


Cys 


Asn 




130 








135 










14 0 




Val 




Thr 


Glu 


Asn 


Lys 


Pro 


Leu 


Ala 


Lys 


Glu 


Leu 


Gly 


He 


Arg 


Val 


Pro 


145 








150 








155 










160 


Phe 


Lys 


He 


Leu 


Lys 


Gly 


Gly Lys 


He 


Val 


Asp 


Glu 


Val 


Thr 


Gly 


Ala 








165 










170 










175 




Lys 


Phe 


Asp 


Lys 


Leu 


Val 


Ala 


Ala 


He 


Glu 


Ala 


Ala 


Arg 


Ser 


Ser 






180 










185 










190 







<210> 56 
<211> 182 
<212> PRT 

<213> Pisum sativum 
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<400> 56 




Met 


Ala 


Leu 


Asn 


1 








Phe 


Asp 


Ser 


Ala 








20 


Thr 


Thr 


Ser 


- - — 

Phe 






35 




Gin 


Arg 


Leu 


Ser 




50 






Leu 


Glu 


Thr 


Ala 


65 








Asn 


Lys 


Asp 


Thr 


Val 


Val 


Leu 


Asp 








100 


Ala 


Pro 


Leu 


Tyr 






115 




Leu 


Lys 


Leu 


Asp 




13 0 






Gly 


lie 


Lys 


Val 


145 








Val 


Lys 


Glu 


Val 


Asp 


Thr 


Val 


Arg 








180 



Leu Cys Thr Ser 
5 

Ser Ser Ser Lys 

Ser Ser Ser lie 

40 

Leu Arg Arg Ser 
55 

Gly Pro Thr Val 
70 

Phe Trp Pro lie 
85 

Met Phe Thr Lys 

Glu Glu Leu Ser 

120 

Cys Asn Gin Asp 
13 5 

Val Pro Thr Phe 
150 

Thr Gly Ala Lys 
165 

Ser Ser 



Pro Lys Trp lie 
10 

Pro Ser Leu Ala 
25 

Leu Cys Ser Lys 

lie Ser Val Ser 

60 

Thr Val Gly Lys 
75 

Val Asn Ala Ala 
90 

Trp Cys Gly Pro 
105 

Gin Lys Tyr Leu 

Asn Lys Ser Leu 

140 

Lys lie Leu Lys 
155 

Phe Asp Asp Leu 
170 



Gly Thr Thr Val 
15 

Ser Ser Phe Ser 
30 

Arg Val Gly Leu 
45 

Val Arg Ser Ser 

Val Thr Glu Val 

80 

Gly Asp Lys Thr 
95 

Cys Lys Val lie 
110 

Asp Val Val Phe 
125 

Ala Lys Glu Leu 

Asp Asn Lys lie 

16 0 

Val Ala Ala lie 
175 



<210> 57 
<211> 190 
<212> PRT 

<213> Spinacia oleracea 



<400> 57 




Met 


Ala 


Leu 


His 


1 








His 


Pro 


He 


Thr 








20 


Gly 


Leu 


Ser 


Arg 






35 




Val 


Phe 


Val 


Val 




50 






Val 


Arg 


Ala 


Ser 


65 








lie 


Val 


Gly 


Lys 


Val 


Lys 


Ala 


Ala 








100 


Trp 


Cys 


Gly 


Pro 






115 




Glu 


Glu 


Tyr 


Leu 




130 






Asn 


Lys 


Thr 


Leu 


145 








Lys 


lie 


Leu 


Lys 


Tyr 


Asp 


Lys 


Leu 








180 



Leu Ser Leu Ser 
5 

Ser Ser Asp Pro 

Arg Val Asp Phe 

40 

Lys Arg Lys Asp 
55 

Met Glu Gin Ala 
70 

Val Thr Glu Val 
85 

Gly Asp Lys Pro 

Cys Lys Ala Met 

12 0 

Asp Val He Phe 
135 

Ala Lys Glu Leu 
150 

Glu Asn Ser Val 
165 

Leu Glu Ala He 



His Gin Ser Trp 
10 

Thr Arg Ser Ser 
25 

Leu Gly Ser Cys 

Arg Arg Arg Met 

60 

Leu Gly Thr Gin 
75 

Asn Lys Asp Thr 
90 

Val Val Leu Asp 
105 

Ala Pro Lys Tyr 

Leu Lys Leu Asp 

140 

Gly He Arg Val 
155 

Val Gly Glu Val 
170 

Gin Ala Ala Arg 
185 



Thr Ser Pro Ala 
15 

Val Pro Gly Thr 
3 0 

Lys He Asn Gly 
45 

Arg Gly Gly Glu 

Glu Met Glu Ala 

80 

Phe Trp Pro He 
95 

Met Phe Thr Gin 
110 

Glu Lys Leu Ala 
125 

Cys Asn Gin Glu 

Val Pro Thr Phe 

160 

Thr Gly Ala Lys 
175 

Ser Ser 
190 



<210> 58 
<211> 106 
<212> PRT 
<213> Anabaena 



<400> 58 

Ser Ala Ala Ala Gin Val Thr Asp Ser Thr Phe Lys Gin Glu Val Leu 
15 10 15 
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Asp 


Ser 


Asp 


Val 


Pro 


Val 


Leu 


Val 


Asp 








20 










25 


Pro 


Cys 


Arg 


Met 


Val 


Ala 


Pro 


Val 


Val 




35 










40 




Glu 


Gly 


Lys 


lie 


Lys 


Val 


Val 


Lys 


Val 




50 










55 






Val 


Ala 


Ser 


Gin 


Tyr Gly 


lie 


Arg 


Ser 


65 










70 








Lys 


Gly 


Gly 


Gin 


Lys 


Val 


Asp '$4et 


Val 










85 










Thr 


Lqu 


Ser 


Gin 


Thr 


Leu 


Glu 


Lys 


His 




\ 




100 










105 



<210> 59 

<211> 179 

<212> PRT 

<213> Arabidopsis thai i ana 

<400> 59 



Met 


Ala 


Ala 


Tyr 


Thr 


Cys 


Thr 


Ser 


Arg 


1 








5 










Glu 


Met 


Arg 


He 


Ala 


Ser 


Ser 


Pro 


Thr 








20 










25 


Met 


Phe 


Ser 


Val 


Leu 


Pro 


Glu 


Ser 


Ser 






35 










40 




Leu 


Ser 


Ser 


Leu 


Ser 


Lys 


Asn 


Ser 


Arg 




50 










55 






Val 


He 


• 

Cys 


Glu 


Ala 


Gin 


Asp 


Thr 


Ala 


65 










70 








Asp 


Ser 


Thr 


Trp 


Asp 


Ser 


Leu 


Val 


Leu 










85 










Val 


Asp 


Phe 


Trp 


Ala 


Pro 


Trp 


Cys 


Gly 








100 










105 


lie 


Val 


Asn 


Glu 


Leu 


Ala 


Gin 


Lys 


Tyr 






115 










120 




Lys 


Leu 


Asn 


Thr 


Asp 


Glu 


Ser 


Pro 


Ala 


130 










135 






Arg 


Ser 


lie 


Pro 


Thr 


He 


Met 


He 


Phe 


145 










150 








Thr 


He 


He 


Gly 


Ala 


Val 


Ser 


Lys 


Asp 



165 



Lys Phe Leu 



<210> 60 
<211> 186 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 60 



Met 


Ala 


Ala 


Phe 


Thr 


Cys 


Thr 


Ser 


Arg 


1 








5 










Glu 


Thr 


Arg 


He 


Val 


Ser 


Ser 


Ser 


Pro 








20 










25 


Arg 


Arg 


Met 


Phe 


Ala 


Val 


Leu 


Pro 


Glu 




35 










40 




Leu 


Ser 


Leu 


Ser 


Pro 


Ala 


Ser 


Leu 


Thr 




50 










55 






Ser 


Arg 


Leu 


Arg 


Arg 


Ala 


Val 


Val 


Cys 


65 










70 








Asp 


He 


Gin 


Val 


Val 


Asn Asp 


Ser 


Thr 








85 










Ala 


Thr 


Gly 


Pro 


Val 


Val 


Val 


Asp 


Phe 






100 










105 


Cys 


Lys 


Met 


He 


Asp 


Pro 


Leu 


Val 


Asn 



PCT/US01/50240 



Phe 


Trp 


Ala 


Pro 


Trp 


Cys 


Gly 










30 






Asp 


Glu 


He 


Ala 


Gin 


Gin 


Tyr 








45 








Asn 


Thr 


Asp 


Glu 


Asn 


Pro 


Gin 






60 










He 


Pro 


Thr 


Leu 


Met 


He 


Phe 




75 










80 


Val 


Gly 


Ala 


Val 


Pro 


Lys 


Thr 


90 










95 




DC U. 














Pro 


Pro 


He 


Ser 


He 


Arci 


Ser 


10 










15 




Glv 


u w J- 


Phe 


Ser 


Thr 


Ara 


Gin 










30 




Glv 




Arn 


Thr 


Arn 


Val 


Ser 








45 








Val 


Ser 


Arg 


Leu 

^ ri H V-*- 


Ara 


Arg 


Gly 






60 










Thr 


Glv 


He 


Pro 


Val 

V >> — *. -I— 


Val 


Asn 




75 










80 


Lys 


Ala 


Asp 


Glu 


Pro 


Val 


Phe 


90 










95 




Pro 


Cys 


Lys 


Met 


He 


Asp 


Pro 










110 






Ala 


Gly 


Gin 


Phe 


Lys 


Phe 


Tyr 








12 5 








Thr 


Pro 


Gly 


Gin 


Tyr 


Gly 


Val 






140 










Val 


Asn 


Gly 


Glu 


Lys 


Lys 


Asp 




155 










160 


Thr 


Leu 


Ala 


Thr 


Ser 


He 


Asn 


170 










175 




Pro 


Pro 


He 


Ser 


Leu 

ill J 


Arg 


Ser 


10 










15 






Ala 


Rpt 


Ser 

h—* W« J— 


Leu 


Ser 


Ser 










30 






Ser 


Ser 


Gly 


Leu 


Arg 


He 


Arg 








45 








Ser 


He 


His 


Gin 


Pro 


Arg 


Val 






60 










Glu 


Ala 


Gin 


Glu 


Thr 


Thr 


Thr 




75 










80 


Trp 


Asp 


Ser 


Leu 


Val 


Leu 


Lys 


90 










95 




Trp 


Ala 


Pro 


Trp 


Cys 


Gly 


Pro 










110 






Asp 


Leu 


Ala 


Gin 


His 


Tyr 


Thr 



-55- 



WO 02/50289 



PCT/US01/50240 



115 

Gly Lys lie Lys 
130 

Pro Gly Gin Tyr 
145 

Gly Gly Glu Lys 

Leu Thr Ser Ser 

180 



120 

Phe Tyr Lys Leu 
135 

Gly Val Arg Ser 
150 

Lys Asp Thr He 
165 

Leu Asp Lys Phe 



Asn Thr Asp Glu 

140 

He Pro Thr He 
155 

He Gly Ala Val 

170 
Leu Pro 
185 



125 

Ser Pro Asn Thr 

Met He Phe Val 

160 

Pro Lys Thr Thr 
17 5 



<210> 61 
<211> 173 
<212> PRT 

<213> Arabidopsis thai i ana 



<400> 61 



Met 


Ala 


He 


Ser 


Ser 


Ser 


Ser 


Ser 


1 








5 








Phe 


His 


Thr 


Ala 


Arg 


His 


He 


Ser 








20 










Thr 


Ser 


Phe 


Ser 


Pro 


Arg 


Ser 


Leu 






35 










40 


Leu 


Ser 


Ser 


Ser 


Ala 


Ser 


Arg 


Leu 




50 










55 




Asp 


Ser 


Arg 


Ala 


Ala 


Glu 


Val 


Thr 


65 










70 






Leu 


Lys 


Ser 


Glu 


Thr 


Pro 


Val 


Leu 








85 








Gly 


Pro 


Cys 


Arg 


Met 


Val 


His 


Arg 






100 










Tyr 


Ala 


Gly 


Lys 


Leu 


Asn 


Cys 


Tyr 




115 










120 


Pro 


Val 


Ala 


Glu 


Glu 


Tyr 


Glu 


He 




130 










135 




Phe 


Lys 


Asn 


Gly 


Glu 


Lys 


Arg 


Glu 


145 








150 






Glu 


Phe 


Tyr 


He 


Ser 


Ala 


He 


Glu 








165 









Ser He Cys Phe Asn Pro Thr Arg 

10 15 
Ser Pro Ser Arg Leu Phe Pro Val 
25 30 
Arg Phe Ser Asp Arg Arg Ser Leu 

45 

Arg Leu Ser Pro Leu Cys Val Arg 

Gin Arg Ser Trp Glu Asp Ser Val 

75 80 
Val Glu Phe Tyr Thr Ser Trp Cys 

90 95 
He He Asp Glu He Ala Gly Asp 
105 " HO 

Leu Leu Asn Ala Asp Asn Asp Leu 

125 

Lys Ala Val Pro Val Val Leu Leu 

14 0 

Ser He Met Gly Thr Met Pro Lys 
155 160 
Arg Val Leu Asn Ser 
170 



<210> 62 
<211> 193 
<212> PRT 

<213> Arabidopsis thai i ana 



<400> 62 
Met Ala Ser Leu 
1 

Pro He Ala Ala 

20 

Arg Arg Arg He 
35 

Ser Ser Arg Ser 
50 

Arg Thr Arg He 
65 

Thr Thr Ala Ala 

Gin Thr Lys Val 

100 

Ala Pro Trp Cys 
115 

Leu Ala Lys Asp 
130 

Asp Glu Ser Pro 
145 



Leu Asp Ser Val 
5 

Ser Val Ser Ser 

Ser Pro Ala Arg 

40 

Leu Val Thr Gin 
55 

Ala Arg Gly Gly 
70 

Ala Val Glu Val 
85 

Leu Glu Ser Asp 

Gly Pro Cys Arg 

120 

Phe Ala Gly Lys 
135 

Asn Thr Pro Asn 
150 



Thr Val Thr Arg 
10 

Ser Ser Ala Ala 
25 

Phe Leu Glu Phe 

Ser Ala Ser Leu 

60 

Arg He Ala Cys 
75 

Pro Asn Leu Ser 
90 

Val Pro Val Leu 
105 

Met He His Pro 

Phe Lys Phe Tyr 

14 0 

Arg Tyr Gly He 
155 



Val Phe Ser Leu 
15 

Pro Ser Val Ser 
30 

Arg Gly Leu Lys 
45 

Gly Ala Asn Arg 

Glu Ala Gin Asp 

80 

Asp Ser Glu Trp 
95 

Val Glu Phe Trp 
110 

He Val Asp Gin 
125 

Lys He Asn Thr 

Arg Ser Val Pro 

160 
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Thr Val He He 

Ala Val Pro Arg 

180 

Glu 



Phe Lys Gly Gly 
165 

Glu Thr Leu Glu 



Glu Lys Lys Asp 
170 

Lys Thr He Glu 
185 



Ser He He Gly 
17 5 

Arg Phe Leu Val 
190 



<210> 63. 
<211> 177 
<212> PRT 

<213> Brassica napus 



<400> 63 
Met Ala Ala Phe 
1 

Glu Met Met He 

20 

Met Phe Ser Val 
35 

Ser Lys Asn Ser 
50 

Glu Ala Gin Asp 
65 

Trp Glu Ser Leu 

Trp Ala Pro Trp 

100 

Glu Leu Ala Gin 
115 

Thr Asp Asp Ser 
130 

Pro Thr He Met 
145 

Gly Ala Val Pro 
Gin 



Thr Cys Thr Ser 
5 

Ala Ser Ser Lys 

Gly Gly Leu Arg 

40 

Arg Ala Ser Arg 
55 

Thr Ala Thr Gly 
70 

Val Leu Lys Ala 
85 

Cys Gly Pro Cys 

Gin Tyr Thr Gly 

120 

Pro Ala Thr Pro 
135 

lie Phe Val Lys 
150 

Lys Thr Thr Leu 
165 



Ser Pro Pro He 
10 

Thr Val Ser Leu 
25 

Thr Arg Val Ser 

Leu Arg Arg Gly 

60 

He Pro Met Val 
75 

Asp Glu Pro Val 
90 

Lys Met He Asp 
105 

Lys He Lys Phe 

Gly Lys Tyr Gly 

140 

Gly Glu Lys Lys 
155 

Ala Thr Ser lie 
170 



Ser Leu Arg Ser 
15 

Ser Thr Arg Gin 
30 

Leu Ser Ser Val 
45 

Gly He He Cys 

Asn Asp Ser Thr 

80 

Val Val Asp Phe 
95 

Pro He Val Asn 
110 

Phe Lys Leu Asn 
125 

Val Arg Ser He 

Asp Thr He He 

16 0 

Asp Lys Phe Leu 
175 



<210> 64 
<211> 140 
<212> PRT 

<213> Chlamydomonas reinhardtii 
<400> 64 



Met 


Ala 


Leu 


Val 


Ala 


Arg Arg 


Ala 


Ala 


Val 


Pro 


Ser 


Ala 


Arg 


Ser 


Ser 


1 








5 










10 










15 


Val 


Ala 


Arg 


Pro 


Ala 


Phe 


Ala 


Arg 


Ala 


Ala 


Pro 


Arg 


Arg 


Ser 


Val 


Val 






20 










25 










30 






Arg 


Ala 


Glu 


Ala 


Gly 


Ala 


Val 


Asn 


Asp 


ASp 


Thr 


Phe 


Lys' Asn 


Val 


Val 




35 








40 










45 








Leu 


Glu 
50 


Ser 


Ser 


Val 


Pro 


Val 
55 


Leu 


Val 


Asp 


Phe 


Trp 
60 


Ala 


Pro 


Trp 


Cys 


Gly 


Pro 


Cys 


Arg 


He 


He 


Ala 


Pro 


Val 


Val 


Asp 


Glu 


He 


Ala 


Gly 


Glu 


65 






70 










75 






Glu 




80 


Tyr 


LyB 


Asp 


Lys 


Leu 


Lys 


Cys 


Val 


Lys 


Leu 


Asn 


Thr 


Asp 


Ser 


Pro 


85 










90 










95 




Asn 


Val 


Ala 


Ser 


Glu 


Tyr 


Gly 


He 


Arg 


Ser 


He 


Pro 


Thr 


He 


Met 


Val 








100 








105 










110 






Phe 


Lys 


Gly 


Gly 


Lys 


Lys 


Cys 


Glu 


Thr 


He 


He 


Gly Ala 


Val 


Pro 


Lys, 




115 










120 










125 








Ala 


Thr 
130 


He 


Val 


Gin 


Thr 


Val 
135 


Glu 


Lys 


Tyr 


Leu 


Asn 
14 0 











<210> 65 
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<211> 167 
<212> PRT 
<213> Zea mays 



<400> 65 
Met Ala Met Glu 
1 

Gly Ser Lys Asp 

20 

Arg Ala Leu Ala 
35 

Arg His Val Cys 
50 

Asp Glu Lys Asn 
65 

Leu Val Glu Phe 

Pro Val lie Asp 

100 

Cys Lys Val Asn 
115 

lie Arg Ser lie 
130 

Glu Ser Val lie 
145 

Asp Lys Tyr lie 



Thr Cys Phe Arg 
5 

Arg Leu Leu Val 

Pro Leu Ser Val 

40 

Gin Ser Lys Asn 
55' 

Trp Asp Gly Leu 
70 

Trp Ala Pro Trp 
85 

Glu Leu Ala Lys 

Thr Asp Asp Ser 

120 

Pro Thr Val Leu 
135 

Gly Ala Val Pro 
150 

Gly Ser Ser 
165 



Ala Trp- Ala Leu 
10 

Gly Asn Leu Val 
25 

Gly Arg Val Ala 

Ala Val Asp Glu 

60 

Val Met Ala Cys 
75 

Cys Gly Pro Cys 
90 

Asp Tyr Ala Gly 
105 

Pro Asn Val Ala 

lie Phe Lys Gly 

140 

Lys Ser Thr Leu 
155 



His Ala Pro Ala 
15 

Leu Pro Ser Lys 
30 

Thr Arg Arg Pro 
45 

Val Val Val Ala 

Glu Thr Pro Val 

80 

Arg Met lie Ala 
95 

Lys lie Thr Cys 
110 

Ser Thr Tyr Gly 
125 

Gly Glu Lys Lys 

Thr Thr Leu lie 

160 



<210> 66 

<211> 172 

<212> PRT 

<213> Oryza sativa 



<400> 66 
Met Ala Leu Glu 
1 

Gin Pro Pro Ser 

20 

Ala Gly Ser Ser 
35 

Pro Leu Arg Pro 
50 

Asp Glu Val Val 
65 

Gly Ser Glu Ala 

Pro Cys Arg Met 

100 

Val Gly Lys He 
115 

He Ala Thr Asn 
130 

Lys Asn Gly Glu 
14 5 

Thr Leu Ala Thr 



Thr Cys Phe Arg 
5 

Ser Gly Gly Ser 

Gin Ser Lys Pro 

40 

Ala Ser Arg Phe 
55 

Val Ala Asp Glu 
70 

Pro Val Leu Val 
85 

He Ala Pro Val 

Lys Cys Cys Lys 

12 0 

Tyr Gly He Arg 
135 

Lys Lys Glu Ser 
150 

He He Asp Lys 
165 



Ala Trp Ala Thr 
10 

Arg Asp Arg Leu 
25 

Arg Leu Ser Val 

Ala Cys Gin Cys 

60 

Lys Asn Trp Asp 
75 

Glu Phe Trp Ala 
90 

He Asp Glu Leu 
105 

Val Asn Thr Asp 

Ser He Pro Thr 

140 

Val He Gly Ala 
155 

Tyr Val Ser Ser 
170 



Leu His Ala Pro 
15 

Leu Leu Ser Gly 
30 

Ala Ser Pro Ser 
45 

Ser Asn Val Val 

Ser Met Val Leu 

80 

Pro Trp Cys Gly 
95 

Ala Lys Glu Tyr 
110 

Asp Ser Pro Asn 
125 

Val Leu Met Phe 

Val Pro Lys Thr 

160 



<210> 67 

<211> 172 

<212> PRT 

<213> Pisum sativum 



<400> 67 

Met Ala Leu Glu Ser Leu Phe Lys Ser He His Thr Lys Thr Ser Leu 
1 5 10 15 
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Ser 


Ser 


Ser 


He Val 


Phe 


He 


Phe 


Lys 


Gly 


Lys 


Ala 


Cys 


Leu 


Leu 


Thr 








20 








25 










30 


Phe 


Thr 


Ser 


Lys 


Ser 


Arg He 


Gin 


Glu 


Ser 


Phe 


Ala 


Glu 


Leu 


Asn 


Ser 




35 






40 










45 






Glu 


Ser 


Leu 


Val 


Leu Leu 


He 


Glu 


Asn 


His 


Val 


Leu 


Leu 


His 


Ala 


Arg 




50 








55 










60 






Glu 




Ala 


Val 


Asn 


Glu Val 


Gin 


Val 


Val 


Asn 


Asp 


Ser 


Ser 


Trp 


Asp 


Leu 


65 








70 










75 






Ala 




80 


Val 


He 


Gly 


Ser Glu 


Thr 


Pro 


Val 


Leu 


Val 


Asp 


Phe 


Trp 


Pro 


Trp 






85 










90 










95 




Cys 


Gly 


Pro 


Cys Arg 


Met 


He 


Ala 


Pro 


He 


He 


Asp 


Glu 


Leu 


Ala 


Lys 




100 








105 










110 


Glu 




Glu 


Tyr 


Ala 


Gly Lys 


He 


Lys 


Cys 


Tyr 


Lys 


Leu 


Asn 


Thr 


Asp 


Ser 




115 






12 0 










125 




Val 




Pro 


Asn 


Thr 


Ala Thr 


Lys 


Tyr 


Gly 


He 


Arg 


Ser 


He 


Pro 


Thr 


Leu 




130 








135 










140 






Val 




Phe 


Phe 


Lys 


Asn Gly 


Glu 


Arg 


Lys 


Asp 


Ser 


Val 


He 


Gly Ala 


Pro 


145 




150 










155 










160 


Lys 


Ala 


Thr 


Leu Ser 


Glu 


Lys 


Val 


Glu 


Lys 


Tyr 


He 














165 










170 















<210> 68 
<211> 181 
<212> PRT 

<213> Spinacia oleracea 
<400> 68 

Met Ala He Glu Asn Cys Leu Gin Leu Ser Thr Ser Ala Ser Val Gly 

15 10 15 

Thr Val Ala Val Lys Ser His Val His His Leu Gin Pro Ser Ser Lys 

20 * 25 30 

Val Asn Val Pro Thr Phe Arg Gly Leu Lys Arg Ser Phe Pro Ala Leu 

35 40 45 

Ser Ser Ser Val Ser Ser Ser Ser Pro Arg Gin Phe Arg Tyr Ser Ser 

50 55 60 

Val Val Cys Lys Ala Ser Glu Ala Val Lys Glu Val Gin Asp Val Asn 
65 70 75 80 

Asp Ser Ser Trp Lys Glu Phe Val Leu Glu Ser Glu Val Pro Val Met 

85 90 95 

Val Asp Phe Trp Ala Pro Trp Cys Gly Pro Cys Lys Leu He Ala Pro 

100 105 HO 

Val He Asp Glu Leu Ala Lys Glu Tyr Ser Gly Lys He Ala Val Tyr 

115 120 125 

Lys Leu Asn Thr Asp Glu Ala Pro Gly He Ala Thr Gin Tyr Asn He 

130 135 140 

Arg Ser He Pro Thr Val Leu Phe Phe Lys Asn Gly Glu Arg Lys Glu 
145 150 155 160 

Ser He He Gly Ala Val Pro Lys Ser Thr Leu Thr Asp Ser He Glu 

165 170 175 

Lys Tyr Leu Ser Pro 

180 



<210> 69 
<211> 175 
<212> PRT 

<213> Tri.ticum aestivum 
<400> 69 

Met Ala Leu Glu Thr Cys Leu Arg Gly Trp Ala Leu Tyr Ala Pro Gin 

15 10 15 

Ala Gly He Arg Glu Arg Leu Ser Ser Gly Ser Tyr Ala Pro Ser Arg 

20 25 30 

Pro Arg Thr Ala Ala Pro Ala Val Val Ser Pro Ser Pro Tyr Lys Ser 

35 40 45 

Ala Leu Val Ala Ala Arg Arg Pro Ser Arg Phe Val Cys Lys Cys Lys 
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50 










55 






Asn 


Val 


Val 


Asp 


Glu 


Val 


lie 


Val 


Ala 


65 








70 








Met 


Val 


lie 


Ala 


Cys 
85 


Glu 


Ser 


Pro 


Val 


Trp 


Cys 


Gly 


Pro 


Cys 


Arg 


Met 


He 


Ala 








100 










105 


Lys 


Asp 


Tyr Val 


Gly Lys 


He 


Lys 


Cys 






115 










120 




Cys 


Pro 


Asn 


lie 


Ala 


Ser 


Thr 


Tyr Gly 


13 0 










135 






Leu 


Met 


Phe 


Lys 


Asp 


Gly Glu 


Lys 


Lys 


145 










150 








Pro 


Lys 


Thr 


Thr 


Leu 


Cys 


Thr 


He 


He 








165 











<210> 70 

<211> 106 

<212> PRT 

<213> Ana cyst is nidulans 



<400> 70 



Ser 


Val 


Ala 


Ala 


Ala 


Val 


Thr 


Asp Ala 


1 
Glu 


Ser 


Ser 


He 


5 

Pro 


Val 


Leu 


Val 


Asp 








20 










25 


Pro 


Cys 


Arg 


Met 


Val 


Ala 


Pro 


Val 


Val 




35 










40 




Ser 


Asp 


Gin 


Val 


Lys 


Val 


Val 


Lys 


val 




50 










55 






Val 


Ala 


Ser 


Gin 


Tyr 


Gly 


He 


Arg 


Ser 


65 










70 








Lys 


Asp 


Gly 


Gin 


Arg 
85 


Val 


Asp 


Thr 


Val 


Thr 


Leu 


Ala 


Asn 


Thr 


Leu 


Asp 


Lys 


His 








100 










105 



<210> 71 

<211> 107 

<212> PRT 

<213> Cyanidium caldarium 

<400> 71 



Met 


Pro 


Ser 


Pro 


He 


Gin 


Val 


Thr 


Asp 


1 
Val 


Asn 


Ser 


Glu 


5 
Lys 


Leu 


Val 


Leu 


Val 








20 








25 


Gly 


Pro 


Cys 


Arg 


Met 


lie 


Ser 


Pro 


Val 






35 










40 




Tyr 


Val 


Glu 


Gin 


Val 


Lys 


He 


Val 


Lys 




50 










55 






Ser 


He 


Ser 


Ala 


Glu 


Tyr 


Gly 


He 


Arg 


65 










70 








Phe 


Lys 


Asp 


Gly 


Lys 
85 


Arg 


Val 


Asp 


Thr 


Ser 


Thr 


Leu 


Thr 


Asn 


Ala 


Leu 


Lys 


Lys 








100 










105 



<210> 72 

<211> 102 

<212> PRT 

<213> Cyanidioschyzon merolae 

<400> 72 



PCT/US0 1/50240 



60 



Asp 


Glu 


Lys 


Asn 


Trp 


Asp 


Asn 




75 










80 


Leu 


Val 


Glu 


Phe 


Trp 


Ala 


Pro 


90 










95 




Pro 


Val 


He 


Asp 


Glu 


Leu 


Ala 








110 






Cys 


Lys 


Val 


Asn Thr Asp 


Asp 








125 








He 


Arg 


Ser 


He 


Pro 


Thr 


Val 






140 










Glu 


Ser 


Val 


He 


Gly 


Ala 


Val 




155 










160 


Asp 


Lys 


Tyr 


He 


Gly 


Ser 




170 










175 





Thr 


Phe 


Lys 


Gin 


Glu 


Val 


Leu 


10 








15 




Phe 


Trp 


Ala 


Pro 


Trp 


Cys 


Gly 










30 






Asp 


Glu 


He 


Ala 


Gin 


Gin 


Tyr 






45 








Asn 


Thr 


Asp 


Glu 


Asn 


Pro 


Ser 






60 










He 


Pro 


Thr 


Leu 


Met 


He 


Phe 




75 










80 


Val 


Gly 


Ala 


Val 


Pro 


Lys 


Thr 


90 










95 




Leu 















Phe 


Ser 


Phe 


Glu 


Lys 


Glu 


Val 


10 










15 




Asp 


Phe 


Trp 


Ala 


Pro 


Trp 


Cys 










30 






He 


Asp 


Glu 


Leu 


Ala 


Gin 


Glu 








45 








He 


Asn 


Thr 


Asp 


Glu 


Asn 


Pro 






60 










Ser 


He 


Pro 


Thr 


Leu 


Met 


Leu 




75 










80 


Val 


He 


Gly Ala Val 


Pro 


Lys 


90 










95 




Tyr 


Leu 
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Met 


Leu 


His 


lie 


1 








Glu 


Lys 




V A -J- 








20 


Arg 


Met 


lie 


Gly 






35 




Lys 


Val 


Val 


Gin 


50 






Tyr Gly 


lie 


Arg 


65 








Arg 


Val 


Asp 


Thr 


Thr 


lie 


Asn 


Lye 








100 



Asp Glu Leu Thr 
5 

Leu Val Asp Phe 

Pro lie Leu Glu 

40 

Val Asn Thr Asp 
55 

Ser lie Pro Thr 
70 

Val lie Gly Ala 
85 

Tyr Leu 



Phe Glu Asn Glu 
10 

Trp Ala Pro Trp 

25 

Glu lie Ala Lys 

Glu Asn Pro Asn 

60 

Leu Met Leu Phe 
75 

Val Pro Lys Ser 
90 



Val Leu Gin Ser 
15 

Cys Gly Pro Cys 
3 0 

Glu Phe Asn Leu 
45 

Leu Ala Thr Phe 

Lys Lys Gly Gin 

80 

He Leu He His 
95 



<210> 73 
<211> 109 
<212> PRT 

<213> Griff ithsia pacifica * 

<400> 73 , 

Met Ser He Ser Gin Val He ABp Thr Ser Phe His Glu Glu Val He 

15 10 15 

Asn Ser Arg Gin Pro Val Leu Val Asp Phe Trp Ala Pro Trp Cys Gly 

20 25 30 

Pro Cys Arg Met He Ala Ser Thr lie Asp Glu He Ala His Asp Tyr 

35 40 45 

Lys Asp Lys Leu Lys Val Val Lys Val Asn Thr Asp Gin Asn Pro Thr 

50 55 60 

He Ala Thr Glu Tyr Gly He Arg Ser He Pro Thr Val Met He Phe 
65 70 75 BO 

He Asn Gly Lys Lys Val Asp Thr Val Val Gly Ala Val Pro Lys Leu 

85 90 95 

Thr Leu Leu Asn Thr Leu Gin Lys His Leu Lys Ser Thr 

100 105 



<210> 74 
<211> 107 
<212> PRT 

<213> Porphyra yezoensis 
<400> 74 

Met Ser Val Ser Gin Val Thr Asp Ala Ser Phe Lys Gin Glu Val He 

15 10 15 

Asn Asn Asn Leu Pro Val Leu Val Asp Phe Trp Ala Pro Trp Cys Gly 

20 25 30 

Pro Cys Arg Met Val Ser Pro Val Val Asp Glu He Ala Glu Glu Tyr 

35 40 45 

Glu Ser Ser He Lys Val Val Lys He Asn Thr Asp Asp Asn Pro Thr 

50 55 60 

He Ala Ala Glu Tyr Gly He Arg Ser He Pro Thr Leu Met He Phe 
65 * 70 75 80 

Lys Ala Gly Glu Arg Val Asp Thr Val He Gly Ala Val Pro Lys Ser 

85 " 90 95 

Thr Leu Ala Ser Thr Leu Asn Lys Tyr He Ser 

100 105 



<210> 75 
<211> 107 
<212> PRT 

<213> Porphyra purpurea 

<400> 75 , ^ n 

Met Ser Val Ser Gin Val Thr Asp Ala Ser Phe Lys Gin Glu Val He 
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1 

1 








c 

_> 










Asn 


Asn 


Asp 


Leu 


jrro 


vai 


Leu 


Val 








20 










25 


Pro 


Cys 


Arg 


Met 


Val 


Ser 


Pro 


Val 


Val 




35 










40 




Glu 


Ser 


Ser 


lie 


Lys 


Val 


Val 


Lys 


He 




50 










55 






He 


Ala 


Ala 


Glu 


Tyr 


Gly 


He 


Arg 


Ser 


65 










70 








Lys 


Ser 


Gly 


Glu 


Arg 


Val 


Asp 


Thr 


Val 








85 










Thr 


Leu 


Glu 


Ser 


Thr 


Leu 


Asn Lys 


Tyr 








100 










105 



PCTYUS01/50240 



10 15 
Phe Trp Ala Pro Trp Cys Gly 

30 

Asp Ala He Ala Glu Glu Tyr 

45 

Asn Thr Asp Asp Asn Pro Thr 
50 

He Pro Thr Leu Met He Phe 

75 80 
He Gly Ala Val Pro Lys Ser 
90 95 
He Ser 



<210> 76 
<211> 114 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 76 



Met 


Ala 


Ser 


Glu 


Glu 


Gly 


Gin 


Val 


1 








5 








Trp 


Asn 


Glu 


Gin 


Leu 


Gin 


Lys 


Ala 






20 










Val 


Asp 


Phe 


Thr 


Ala 


Ser 


Trp 


Cys 




35 










40 


Phe 


Phe 
50 


Ala 


Asp 


Leu 


Ala 


Lys 
55 


Lys 


Val 


Asp 


Thr 


Asp 


Glu 


Leu 


Lys 


Ser 


65 










70 






Ala 


Met 


Pro 


Thr 


Phe 
85 


Met 


Phe 


Leu 


Val 


Val 


Gly 


Ala 
100 


Lys 


Lys 


Asp 


Glu 


Leu 


Ala 















He Ala Cys His Thr Val Glu Thr 

10 15 
Asn Glu Ser Lys Thr Leu Val Val 
25 ~ 30 

Gly Pro Cys Arg Phe He Ala Pro 

45 

Leu Pro Asn Val Leu Phe Leu Lys 

60 

Val Ala Ser Asp Trp Ala He Gin 

75 80 
Lys Glu Gly Lys He Leu Asp Lys 

90 95 
Leu Gin Ser Thr He Ala Lys His 
105 HO 



<210> 77 




<211> 110 




<212> PRT 




<213> Anabaena 


<400> 77 




Ser Lys Gly 


Val 


1 




Leu Lys Ala 


Glu 




20 


Gly Pro Cys 


Gin 


35 




Tyr Ser Asp 


Arg 


50 




Thr Thr Val 


Lys 


65 




Val Lys Gly 


Glu 


Asp Lys Leu 


Leu 




100 



He Thr He Thr 
5 

Gin Pro Val Leu 

Leu Met Ser Pro 

40 

Leu Lys Val Val 
55 

Lys Tyr Lys Val 
70 

Gin He Leu Asp 
85 

Ser Phe Leu Asp 



Asp Ala Glu Phe 
10 

Val Tyr Phe Trp 
25 

Leu He Asn Leu 

Lys Leu Glu He 

60 

Glu Gly Val Pro 
75 

Ser Thr Glu Gly 
90 

Thr His Leu Asn 
105 



Glu Ser Glu Val 
15 

Ala Ser Trp Cys 
30 

Ala Ala Asn Thr 
45 

Asp Pro Asn Pro 

Ala Leu Arg Leu 

80 

Val He Ser Lys 
95 

Asn Asn 
110 



<210> 78 
<211> 123 
<212> PRT 

<213> Brassica napus 
<400> 78 
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Met 


Ala 


Ala 


Thir 


Ala 


Glu 


Val 


He 


Pro 


Ala 


Gly 


Glu 


Val 


He 


Ala 


Cys 


1 








5 










10 










15 




His 


Thr 


Val 


Glu 


Asp 


Trp 


Asn 


Asn 


Lys 


Leu 


Lys 


Ala 


Ala 


Lys 


Glu 


Ser 








20 








25 










30 






Asn 


Lys 


Leu 


lie 


Val 


lie 


Asp 


Phe 


Thr 


Ala 


Val 


Trp 


Cys 


Pro 


Pro 


Cys 




35 








40 










45 


His 






Arg 


Phe 


lie 


Ala 


Pro 


lie 


Phe 


Val 


Glu 


Leu 


Ala 


Lys 


Lys 


Leu 


Asp 


50 










55 










60 








Gin 


Val 


Val 


Phe 


Phe 


Lys 


Val 


Asp 


Val 


Asp 


Glu 


Leu 


Ala 


Thr 


Val 


Ala 


65 








70 








75 








Gly 


80 


Glu 


Phe 


Asp 


Val 


Gin 


Ala 


Met 


Pro 


Thr 


Phe 


Val 


Tyr 


Met 


Lys 


Glu 








85 










90 










95 


Ala 


Glu 


Lys 


Leu 


Asp 


Lys 


Val 


Val 


Gly Ala 


Ala 


Lys 


Glu 


Glu 


He 


Glu 






100 








105 










110 






Lys 


Leu 


Leu 


Lys 


His 


Ser 


Gin 


Val 


Ala 


Ala 


Ala 




r 










115 










120 



















<210> 79 
<211> 126 
<212> PRT 

<213> Nicotiana tabacum 



<400> 79 
























Phe 


Gly 


Met 


Ala 


Ala 


Asn Asp 


Ala 


Thr 


Ser 


Ser 


Glu 


Glu Gly 


Gin 


Val 


1 








5 










10 










15 




Cys 


His 


Lys 


Val 


Glu 


Glu 


Trp 


Asn 


Glu 


Tyr 


Phe 


Lys 


Lys 


Gly 


Val 


Glu 




20 










25 










30 






Thr 


Lys 


Lys 


Leu 


Val 


Val 


Val 


Asp 


Phe 


Thr 


Ala 


Ser 


Trp 


Cys 


Gly 


Pro 




35 










40 










45 








Cys 


Arg 


Phe 


He 


Ala 


Pro 


He 


Leu Ala Asp 


He 


Ala 


Lys 


Lys 


Met 


Pro 


50 










55 










60 






Val 




His 


Val 


He 


Phe 


Leu 


Lys 


Val 


Asp 


Val 


Asp 


Glu 


Leu 


Lys 


Thr 


Ser 


65 










70 










75 










80 


Ala 


Glu 


Trp 


Ser 


Val 


Glu 


Ala 


Met 


Pro 


Thr 


Phe 


Val 


Phe 


He 


Lys 


Asp 








85 










90 










95 


Gin 


Gly 


Lys 


Glu 


Val 


Asp 


Arg 


Val 


Val 


Gly 


Ala 


Lys 


Lys 


Glu 


Glu 


Leu 




100 










105 










110 






Gin 


Thr 


He 


Val 


Lys 


His 


Ala 


Ala 


Pro 


Ala 


Thr 


Val 


Thr 


Ala 










115 








120 










125 









<210> 80 
<211> 133 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 80 




















Glu 




Ala 


Thr 


Met 


Gly 


Gly 


Ala 


Leu 


Ser 


Thr 


Val 


Phe 


Gly 


Ser Gly 


Asp 


1 




5 










10 










15 




Ala 


Ala 


Gly 


Thr 


Glu 


Ser 


Glu 


Pro 


Ser 


Arg 


Val 


Leu 


Lys 


Phe 


Ser 


Ser 






20 










25 










30 






Ser 


Ala 


Arg 


Trp 


Gin 


Leu 


His 


Phe 


Asn 


Glu 


He 


Lys 


Glu 


Ser 


Asn 


Lys 






35 








40 










45 








Leu 


Leu 


Val 


Val 


Asp 


Phe 


Ser 


Ala 


Ser 


Trp 


Cys 


Gly 


Pro 


Cys 


Arg 


Met 




50 








55 










60 






Val 




He 


Glu 


Pro 


Ala 


He 


His 


Ala 


Met 


Ala 


Asp 


Lys 


Phe 


Asn 


Asp 


Asp 


65 










70 










75 








Glu 


80 


Phe 


Val 


Lys 


Leu 


Asp 


Val 


Asp 


Glu 


Leu 


Pro 


Asp 


Val 


Ala 


Lys 


Phe 








85 










90 










95 


Glu 


Asn 


Val 


Thr 


Ala 


Met 


Pro 


Thr 


Phe 


Val 


Leu 


Val 


Lys 


Arg 


Gly 


Lys 






100 










105 










110 




Val 


He 


Glu 


Arg 
115 


He 


He 


Gly 


Ala 


Lys 
120 


Lys 


Asp 


Glu 


Leu 


Glu 
125 


Lys 


Lys 


Ser 


Lys 
130 


Leu 


Arg 


Ala 
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<210> 81 
<211> 119 
<212> PRT 

<213> Brassica napus 



<400> 81 
Met Ala Ala Glu 
1 

Trp Ala Val Gin 

20 

lie Asp Phe Thr 
35 

Val Phe Ala Asp 
50 

Lys Val Asp Val 
65 

Glu Ala Met Pro 

Lys Val Val Gly 

100 

His Thr Gly Val 
115 



Glu Gly Gin Val 
5 

Leu Asp Thr Ala 

Ala Ser Trp Cys 

40 

Leu Ala Lys Lys 
55 

Asp Glu Leu Gin 
70 

Thr Phe Val Leu 
85 

Ala Arg Lys Glu 
Ala Thr Ala 



lie Gly Cys His 
10 

Lys Gin Ser Asn 
25 

Pro Pro Cys Arg 

Phe Met Ser Ser 

60 

Asn Val Ala Gin 
75 

lie Lys Asp Gly 
90 

Asp Leu His Ala 
105 



Glu He Asp Val 
15 

Lys Leu He Val 
3 0 

Met He Ala Pro 
45 

Ala He Phe Phe 

Glu Phe Gly Val 

80 

Asn Val Val Asp 
95 

Thr He Ala Lys 
110 



<210> 82 
<211> 118 
<212> PRT 

<213> Nicotiana tabacum 



<400> 82 




Met 


Ala 


Glu 


Glu 


1 








Asn 


Glu 


His 


Leu 








20 


Asp 


Phe 


Thr 


Ala 






35 




Tyr 


Ala 


Glu 


Leu 




50 






Asp 


Val 


Asp 


Glu 


65 








Met 


Pro 


Thr 


Phe 


Val 


Gly 


Ala 


Lys 








100 


Ser 


Ser 


Thr 


Ser 






115 





Gly Gin Val He 
5 

Gin Lys Gly He 

Ser Trp Cys Gly 

40 

Ala Lys Lys Met 
55 

Leu Lys Ser Val 

70 • 

Met Phe Leu Lys 
85 

Lys Asp Glu Leu 
Thr Ala 



Gly Val His Thr 
10 

Asp Asp Lys Lys 
25 

Pro Cys Lys Phe 

Pro Thr Val Thr 

60 

Ala Thr Asp Trp 
75 

Glu Gly Lys He 
90 

Gin Gin Thr He 
105 



Val Asp Ala Trp 
15 

Leu He Val Val 
30 

He Ala Ser Phe 
45 

Phe Leu Lys Val 

Ala Val Glu Ala 

80 

Val Asp Lys Val 
95 

Ala Lys His He 
110 



<210> 83 
<211> 118 
<212> PRT 

<213> Arabidopsis tha liana 



<400> 83 




Met 


Ala 


Ala 


Glu 


1 








Thr 


Glu 


Lys 


Leu 






20 


Asp 


Phe 


Thr 


Ala 




35 




Phe 


Ala 


Asp 


Leu 




50 






Asp 


Val 


Asp 


Glu 


65 








Met 


Pro 


Thr 


Phe 


Val 


Gly Ala 


Ala 



Gly Glu Val He 
5 

Lys Ala Ala Asn 

Thr Trp Cys Pro 

40 

Ala Lys Lys His 
55 

Leu Asn Thr Val 
70 

He Phe Met Lys 
85 

Lys Glu Glu He 



Ala Cys His Thr 
10 

Glu Ser Lys Lys 
25 

Pro Cys Arg Phe 

Leu Asp Val Val 

60 

Ala Glu Glu Phe 
75 

Glu Gly Glu He 
90 

He Ala Asn Leu 



Val Glu Asp Trp 
15 

Leu He Val He 
30 

He Ala Pro Val 
45 

Phe Phe Lys Val 

Lys Val Gin Ala 

80 

Lys Glu Thr Val 
95 

Glu Lys His Lys 
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100 105 110 

Thr Val Val Ala Ala Ala 
115 



<210> 84 
<211> 125 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 84 



Met 


Ala 


Ala 


Glu 


Glu 


Gly 


Gin 


Val 


He 


Gly 


Cys 


His 


Thr 


Asn 


Asp 


Val 


1 








5 








10 










15 


Val 


Trp 


Thr 


Val 


Gin 


Leu 


Asp 


Lys 


Ala 


Lys 


Glu 


Ser 


Asn 


Lys 


Leu 


He 






20 










25 










30 






lie 


Asp 


Phe 


Thr 


Ala 


Ser 


Trp 


Cys 


Pro 


Pro 


Cys 


Arg 


Met 


He 


Ala 


Pro 




35 










40 










45 




Phe 


Phe 


He 


Phe 


Asn 


Asp 


Leu 


Ala 


Lys 


Lys 


Phe 


Met 


Ser 


Ser 


Ala 


He 




50 








55 










60 










Lys 


Val 


Asp 


Val 


Asp 


Glu 


Leu 


Gin 


Ser 


Val 


Ala 


Lys 


Glu 


Phe 


Gly Val 


65 






70 










75 








Val 


80 


Glu 


Ala 


Met 


Pro 


Thr 


Phe 


Val 


Phe 


He 


Lys 


Ala 


Gly Glu 


Val 


Asp 










85 










90 










95 




Lys 


Leu 


Val 


Gly 


Ala 


Asn 


Lys 


Glu 


Asp 


Leu 


Gin 


Ala 


Lys 


He 


Val 


Lys 






100 










105 










110 






His 


Thr 


Gly 
115 


Val 


Thr 


Thr 


Val 


Val 
120 


Asn 


Gin 


Phe 


Glu 


Ala 
125 









<210> 85 
<211> 118 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 85 



Met 


Ala 


Gly 


Glu 


Gly 


Glu 


Val 


He 


Ala 


Cys 


His 


Thr 


Leu 


Glu 


Val 


Trp 


1 






5 










10 










15 




Asn 


Glu 


Lys 


Val 


Lys 


Asp 


Ala 


Asn 


Glu 


Ser 


Lys 


Lys 


Leu 


He 


Val 


He 






20 








25 










30 






Asp 


Phe 


Thr 


Ala 


Ser Trp 


Cys 


Pro 


Pro 


Cys 


Arg 


Phe 


He 


Ala 


Pro 


Val 




35 










40 










45 






He 


Phe 


Ala 


Glu 


Met 


Ala 


Lys 


Lys 


Phe 


Thr 


Asn 


Val 


Val 


Phe 


Phe 


Lys 




50 










55 










60 






Glu 


Ala 


Asp 


Val 


Asp 


Glu 


Leu 


Gin 


Ala 


Val 


Ala 


Gin 


Glu 


Phe 


Lys 


Val 


65 








70 










75 










80 


Met 


Pro 


Thr 


Phe 


Val 


Phe 


Met 


Lys 


Glu 


Gly Asn 


He 


He 


Asp 


Arg 


Val 










85 








90 










95 


Gly 


Val 


Gly 


Ala 


Ala 


Lys 


Asp 


Glu 


He 


Asn 


Glu 


Lys 


Leu 


Met 


Lys 


His 






100 






105 










110 






Gly 


Leu 


Val 


Ala 


Ser 


Ala 
























115 





























<210> 86 
<211> 123 
<212> PRT 

<213> Brassica rapa 
<400> 86 

Met Ala Ala Thr Ala Glu Leu He 

1 5 
His Thr Val Glu Asp Trp Asn Asn 

20 

Asn Lys Leu He Val He Asp Phe 

35 4 0 

Arg Phe He Ala Pro He Phe Val 
50 55 



Pro Ala Gly Glu Val He Ala Cys 

10 " 15 

Lys Leu Lys Ala Ala Lys Glu Ser 
25 30 
Thr Ala Val Trp Cys Pro Pro Cys 

45 

Glu Leu Ala Lys Lys His Leu Asp 

60 
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Val Val Phe Phe 
65 

Glu Phe Asp Val 

Glu Lys Leu Asp 

100 

Lys Leu Leu Lys 
115 



Lys Val Asp Val 
70 

Gin Ala Met Pro 
85 

Lys Val Val Gly 

His Ser Gin Val 

120 



Asp Glu Leu Ala 
75 

Thr Phe Val Tyr 
90 

Ala Ala Lys Glu 
105 

Ala Ala Ala 



Thr Val Ala Lys 

80 

Met Lys Gly Glu 
95 

Glu lie Glu Ala 
110 



<210> B7 

<211> 112 

<212> PRT 

<213> Chlamydomonas reinhardtii 



<400> 87 
Gly Gly Ser Val 
1 

Leu Ala Lys Gly 

20 

Ala Thr Trp Cys 
35 

Leu Ser Asn Asp 
50 

Asp Ala Val Ala 
65 

Thr Phe His Val 

Ala Ser Gin Asp 

100 



lie 


Val 


He 


Asp 


5 








Lys 


Glu 


Glu 


His 


Gly 


Pro 


Cys 


Lys 








40 


Tyr 


Ala 


Gly 


Lys 






55 




Ala 


Val 


Ala 


Glu 




70 






Tyr 


Lys 


Asp 


Gly 


85 








Lys 


Leu 


Lys 


Ala 



Ser Lys Ala Ala 
10 

Lys Pro He Val 
25 

Met He Ala Pro 

Val He Phe Leu 

60 

Ala Ala Gly He 
75 

Val Lys Ala Asp 
90 

Leu Val Ala Lys 
105 



Trp Asp Ala Gin 
15 

Val Asp Phe Thr 
30 

Leu Phe Glu Thr 
45 

Lys Val Asp Val 

Thr Ala Met Pro 

80 

Asp Leu Val Gly 
95 

His Ala Ala Ala 
110 



<210> 88 

<211> 116 

<212> PRT 

<213> Fagopyrum esculent urn 



<400> 88 




Met 


Ala 


Glu 


Glu 


1 








Asn 


Glu 


Lys 


Phe 








20 


Asp 


Phe 


Thr 


Ala 






35 




Val 


Ser 


Glu 


Leu 




50 






Asp 


Val 


Asp 


Asp 


65 








Met 


Pro 


Ser 


Phe 


Val 


Gly Ala 


Arg 








100 


Pro 


He 


Thr 


Ala 






115 





Ala Gin Val He 
5 

Gin Lys Ala Lys 

Ser Trp Cys Gly 

40 

Ala Lys Lys Phe 

. 55 

Leu Lys Asp Val 
70 

Val He Leu Lys 
85 

Lys Asp Glu Leu 



Ala Cys His Thr 
10 

Asp Ser Gly Lys 
25 

Pro Cys Arg Val 

Pro His Val Ala 

60 

Ala Glu Glu Tyr 
75 

Glu Gly Gin Glu 
90 

Leu His Lys He 
105 



Val Gin Glu Trp 
15 

Leu He Val He 
30 

He Thr Pro Tyr 
45 

Phe Phe Lys Val 

Lys Val Glu Ala 

80 

Val Glu Arg He 
95 

Ala Val His Ala 
110 



<210> 89 
<211> 122 
<212> PRT 

<213> Oryza sativa 
<400> 89 

Met Ala Ala Glu Glu Gly Val Val 

1 5 
Phe Asp Ala Gin Met Thr Lys Ala 

20 

He Asp Phe Thr Ala Ser Trp Cys 



He Ala Cys His Asn Lys Asp Glu 

10 15 
Lys Glu Ala Gly Lys Val Val He 
25 30 
Gly Pro Cys Arg Phe He Ala Pro 
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35 

Val Phe Ala Glu 
50 

Val Asp Val Asp 
65 

Ala Met Pro Thr 

Val Val Gly Ala 

100 

Val Gly Ala Thr 
115 



40 

Tyr Ala Lys Lys 
55 

Glu Leu Lys Glu 
70 

Phe Leu Phe He 
85 

Arg Lys Asp Asp 

Ala Ala Ser Ala 

120 



Phe Pro Gly Ala 

60 

Val Ala Glu Lys 
75 

Lys Asp Gly Ala 
90 

Leu Gin Asn Thr 
105 

Ser Ala 



45 

Val Phe Leu Lys 

Tyr Asn Val Glu 

80 

Glu Ala Asp Lys 
95 

He Val Lys His 
110 



<210> 90 
<211> 125 
<212> PRT 

<213> Picea mariana 



<400> 90 
Met Ala Glu Gly 
1 

Ser LyB Leu Gin 

20 

Phe Thr Ala Thr 
35 

Val Glu Leu Ser 
50 

Val Asp Glu Leu 
65 

Pro Thr Phe He 
Gly Ala Lys Lys 

100 

Ala Ala Thr Thr 
115 



Asn Val Phe Ala 
5 

Glu Ala He Asp 

Trp Cys Gly Pro 

40 

Lys Lys Phe Pro 
55 

Arg Asp Val Ala 
70 

Phe He Lys Asp 
85 

Asp Asp Leu Glu 

Thr Glu Ala Thr 

120 



Cys His Ser Thr 
10 

Thr Lys Arg Leu 
25 

Cys Arg Val He 

Glu He Phe Phe 

60 

Gin Glu Trp Asp 
75 

Gly Lys Ala Val 
90 

Arg Lys Val Ala 
105 

Leu Pro Ala Gin 



Glu Gly Trp Arg 
15 

Val Ala Val Asp 
30 

Gly Pro Val Phe 
45 

Leu Lys Val Asp 

Val Glu Ala Met 

80 

Asp Lys Val Val 
95 

Ala Leu Ala Ala 
110 

Ala 
125 



<210> 91 
<211> 118 
<212> PRT 

<213> Ricinus communis 



<400> 91 




Met 


Ala 


Ala 


Glu 


1 








Trp 


Asn 


Glu 


Gin 






20 


Val 


Asp 


Phe 


Thr 






35 




Phe 


Leu 


Ala 


Glu 




50 






Val 


Asp 


Val 


Asp 


65 








Ser 


Met 


Pro 


Thr 


Val 


Val 


Gly 


Ala 








100 


Met 


Ala 


Thr 


Ala 






115 





Glu Gly Gin Val 
5 

Leu Gin Lys Gly 

Ala Ser Trp Cys 

40 

Leu Ala Lys Lys 
55 

Glu Leu Lys Thr 
70 

Phe Met Phe Leu 
85 

Lys Lys Asp Glu 
Ser Thr 



He Gly Cys His 
10 

Asn Asp Thr Lys 
25 

Gly Pro Cys Arg 

Leu Pro Asn Val 

60 

Val Ala His Glu 
75 

Lys Glu Gly Lys 
90 

Leu Gin Gin Thr 
105 



Thr Val Glu Ala 
15 

Gly Leu He Val 
30 

Phe He Ala Pro 
45 

Thr Phe Leu Lys 

Trp Ala Val Glu 

80 

He Met Asp Lys 
95 

He Ala Lys His 
110 



<210> 92 
<211> 126 
<212> PRT 

<213> triticum aestivum 
<400> 92 
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Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Ala 


Thr 


1 








5 








Gly 


Glu 


Val 


He 
20 


Ser 


Val 


His 


Ser 


Glu 


Glu 


Ala 
35 


Asn 


Ala 


Ala 


Lys 


Lys 
40 


Ser 


Trp 
50 


Cys 


Gly 


Pro 


Cys 


Arg 
55 


He 


Ala 


Lys 


Lys 


Phe 


Pro 


Ala 


Ala 


Val 


65 








70 






Leu 


Lys 


Pro 


He 


Ala 
85 


Glu 


Gin 


Phe 


Leu 


Phe 


Met 


Lys 
100 


Glu 


Gly 


Asp 


Val 


Lys 


Glu 


Glu 


Leu 


Thr 


Thr 


Lys 


Val 






115 








120 



<210> 93 
<211s» 109 
<212> PRT 

<213> Aspergillus nidulans 



<400> 93 



Gly Ala 


Ser 


Glu 


His 


Val 


Pro 


Pro 


1 






5 








Glu Lys 


Val 


Leu 


Asn 


Ala 


Lys 


Gly 






20 










Thr Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Ala 




35 










40 


Ala Gin 


Thr 


Tyr 


Thr 


Asp 


Ala 


Ser 


50 










55 




Leu Ser 


Glu 


Val 


Ala 


Ala 


Glu 


Leu 


65 








70 






Leu Leu 


Phe 


Lys 


Asp 


Gly 


Gin Lys 








85 








Pro Gly 


Ala 


Leu 


Glu 


Ala 


Gly 


He 




100 











<210> 94 

<211> 105 

<212> PRT 

<213> Alicyclobacillus 



<400> 94 



Ala 


Thr 


Met 


Thr 


Leu 


Thr 


Asp 


Ala 


1 








5 








Asp 


Lys 


Pro 


Val 
20 


Leu 


Val 


Asp 


Phe 


Arg 


Met 


Met 
35 


Ala 


Pro 


Val 


Leu 


Glu 
40 


Lys 


Val 
50 


Thr 


Val 


Ala 


Lys 


Leu 
55 


Asn 


Ser 


Gin 


Phe 


Gly 


He 


Met 


Ser 


He 


65 










70 






Gly Arg 


Pro 


Val 


Lys 


Gin 


Leu 


He 










85 








Glu 


Ala 


Gin 


Leu 
100 


Ala 


Asp 


Val 


Leu 



<210> 95 

<211> 91 

<212> PRT 

<213> Archaeoglobus fulgidus 



Ala 


Thr 


Ala 


Ala 


Ala 


Val 


Gly 


Ala 




10 










15 




Leu 


Glu 


Gin 


Trp 


Thr 


Met 


Gin 


He 


25 










30 






Leu 


Val 


Val 


He 


Asp 


Phe 


Thr 


Ala 










45 








Met 


Ala 


Pro 


He 


Phe 


Ala 


Asp 


Leu 








60 










Phe 


Leu 


Lys 


Val 


Asp 


Val 


Asp 


Glu 






75 










80 


Ser 


Val 


Glu 


Ala 


Met 


Pro 


Thr 


Phe 




90 










95 




Lys 


Asp 


Arg 


Val 


Val 


Gly 


Ala 


He 


105 










110 






Gly Leu His 


Ala 


Ala 


Gin 














125 









He 


Thr 


Ser 


Lys 


Ala 


Glu 


Phe 


Gin 




10 








15 




Phe 


Val 


Val 


Val 


Asp 


Cys 


Phe 


Ala 


25 










30 






He 


Ala 


Pro 


Thr 


Val 


Glu 


Lys 


Phe 










45 








Phe- Tyr 


Gin 


He 
60 


Asp 


Val 


Asp 


Glu 


Gly 


He 


Arg 


Ala 


Met 


Pro 


Thr 


Phe 






75 










80 


Val 


Ser 


Asp 


Val 


Val Gly 


Ala 


Asn 




90 










95 




Lys 


Ala 


Leu 


Leu 


Ala 









105 



Asn 


Phe 


Gin 


Gin 


Ala 


He 


Gin Gly 




10 










15 




Trp 


Ala 


Ala 


Trp 


Cys 


Gly 


Pro 


Cys 


25 










30 






Glu 


Phe 


Ala 


Glu 


Ala 


His 


Ala 


Asp 










45 








Val 


Asp 


Glu 


Asn 
60 


Pro 


Glu 


Thr 


Thr 


Pro 


Thr 


Leu 


He 


Leu 


Phe 


Lys 


Gly 






75 










80 


Gly 


Tyr 


Gin 


Pro 


Lys 


Glu 


Gin 


Leu 




90 










95 





Gin 



105 
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<400> 95 
Met Val Met Met 
1 

Lys Ala Glu Lys 

20 

Ala He Asn Leu 
35 

Lys Phe Gly lie 
50 

Leu He Leu Gly 
65 

Lys Leu Lys Gly 



Lys Leu Phe Thr 
5 

Val Val Ser Lys 

Pro Val Asn Thr 

40 

Arg Gly Val Pro 
55 

Val Pro Asp Glu 
70 

Gly Glu Glu Tyr 
85 



Ser Pro Thr Cys 
10 

Val Ala Lys Glu 
25 

Asp Glu Gly Leu 

Ala Leu Val He 

60 

Gly Glu Leu Arg 
75 

Gly Ala Ser 
90 



Pro Tyr Cys Pro 
15 

Glu Gly Val Leu 
30 

Lys Glu Ala Leu 
45 

Asn Asp Lys Tyr 

Gin Leu He Arg 

80 



<210> 96 
<211> 103 
<212> PRT 

<213> Bacillus subtilis 



<400> 96 
Ala He Val Lys 
1 

Gly Val Val Leu 

20 

Met He Ala Pro 
35 

Leu Lys He Val 
50 

Lys Tyr Gly Val 
65 

Glu Val Val Glu 

Glu Leu Val Asn 

100 



Ala Thr Asp Gin 
5 

Ala Asp Phe Trp 

Val Leu Glu Glu 

40 

Lys He Asp Val 
55 

Met Ser He Pro 
70 

Thr Ser Val Gly 
85 

Lys His Leu 



Ser Phe Ser Ala 
10 

Ala Pro Trp Cys 
25 

Leu Asp Gin Glu 

Asp Glu Asn Gin 

60 

Thr Leu Leu Val 
75 

Phe Lys Pro Lys 
90 



Glu Thr Ser Glu 
15 

Gly Pro Cys Lys 
30 

Met Gly Asp Lys 
45 

Glu Thr Ala Gly 

Leu Lys Asp Gly 

80 

Glu Ala Leu Gin 
95 



<210> 97 
<211> 87 
<212> PRT 

<213> Bacteriophage T4 



<400> 97 
Met Phe Lys Val 
1 

Cys Asp Asn Ala 

20 

Phe He Asn He 
35 

Ala Glu Leu Leu 
50 

Met Pro Gin Val 
65 

Gin Leu Arg Glu 



Tyr Gly Tyr Asp 
5 

Lys Arg Leu Leu 

Met Pro Glu Lys 

40 

Thr Lys Leu Gly 
55 

Phe Ala Pro Asp 
70 

Tyr Phe Lys 
85 



Ser Asn He His 
10 

Thr Val Lys Lys 
25 

Gly Val Phe Asp 

Arg Asp Thr Gin 

60 

Gly Ser His He 
75 



Lys Cys Val Tyr 
15 

Gin Pro Phe Glu 
30 

Asp Glu Lys He 
45 

He Gly Leu Thr 

Gly Gly Phe Asp 

80 



<210> 98 
<211> 117 
<212> PRT 

<213> Borrelia burgdorferi 



Met°Ala 8 Ile Ser Leu Thr Glu Glu Asp Phe Val Val Lys Val Phe Asp 

1 5 10 I 5 

Tvr Lys Asn Asp Lys Glu Trp Ser Phe Arg Gly Asp Arg Pro Ala He 

20 25 3° 
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lie 


Asp 


Phe 


Tyr 






35 




He 


Phe 


Glu 


Lys 




50 






Lys 


Val 


Asp 


Thr 


65 








Gin 


Ser 


Leu 


Pro 


Val 


Ser 


Val 


Gly 








100 


Asp 


Phe 


Phe 


Gly 






115 





Ala Asn Trp Cys 

40 

Leu Ser Lys Lys 
55 

Asp Lys Glu Gin 
70 

Thr He Leu Phe 
85 

Phe Leu Gin Glu 
Phe 



Gly Pro Cys Lys 

Tyr Glu Asn Ser 

60 

Asp He Ser Ser 
75 

He Pro Val Asp 
90 

Asp Ala Phe Glu 
105 



Met Leu Ser Pro 
45 

He Asp Phe Tyr 

Ala He Gly Val 

80 

Gly Lys Pro Lys 
95 

Asn He He Lys 
110 



<210> 99 
<211> 108 
<212> PRT 

<213> Buchnera aphidicola 

Met°Asn 9 Lys He He Glu Leu Thr Asp Gin Asn Phe Glu Glu Gin Val 

1 5 10 15 

Leu Asn Ser Lys Ser Phe Phe Leu Val Asp Phe Trp Ala Gin Trp Cys 

20 25 30 

Asn Pro Cys LyB He Leu Ala Pro He Leu Glu Glu He Ser Lys Glu 

35 40 45 

Tyr Ser Asn Lys Val He Val Gly Lys Leu Asn He Glu Glu Asn Pro 

50 55 60 

Asn Thr Ala Pro Val Tyr Ser He Arg Ser He Pro Thr Leu Leu Leu 
65 70 75 80 

Phe Asn Asn Ser Glu Val Leu Ala Thr Lys Val Gly Ala Val Ser Lys 

85 90 95 

Leu Glu Leu Lys Glu Phe Leu Asp Glu Asn lie Asn 

100 105 



<210> 100 
<211> 108 
<212> PRT 
<213> aphidicola 

Met°Asn°Lys He He Glu Leu Thr Asp Gin Asn Phe Glu Lys Glu Val 

1 5 1° 15 " 

Leu Glu His Lys Ser Phe Val Leu Val Asp Phe Trp Ala Glu Trp Cys 

20 25 30 

Asn Pro Cys Lys He Leu Ala Pro He Leu Glu Glu He Ala Gin Glu 

35 " 40 45 

Tyr Phe Asn Lys He Lys Val Gly Lys Leu Asn He Glu Lys Asn Pro 

50 55 60 

Asn Thr Ala Pro lie Tyr Ser He Arg Gly He Pro Ala Leu Leu Leu 
65 70 75 80 

Phe His Gly Arg Glu Val Leu Ala Thr Lys Val Gly Ala He Ser Lys 

85 90 95 

Leu Gin Leu Lys Asp Phe Leu Asp Glu Asn He Lys 

100 105 

if 

<210> 101 
<211> 108 
<212> PRT 

<213> Chlorobium limicola 
<220> 

<222> If^ 1 ^ 38, 42, 45, 54, 55, 58, 66, 72, 75, 79, 80, 81, 94, 
99, 103 
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<223> Xaa = Any 

<400> 101 
Ala Gly Lys Tyr 
1 

Xaa Asp Ser Asp 

20 

Gly Pro Cys Met 
35 

Tyr Glu Gly Lys 
50 

Asn Xaa Ala Gly 
65 

Xaa Lys Gly Gly 

Asn Met Xaa Ala 

100 



Amino Aeid 



Phe Glu Ala Thr 
5 

Lys Ala Val Leu 

Met Xaa Gly Pro 

40 

Ala Xaa Xaa Ala 
55 

Gin Tyr Gly Xaa 
70 

Lys Val Val Asp 
85 

Lys Lys Xaa Asp 



Asp Lys Asn Phe 
10 

Val Asp Phe Trp 
25 

Val Xaa Glu Gin 

Lys Xaa Asn Val 

60 

Arg Ser Xaa Pro 
75 

Gin Met Val Gly 
90 

Glu His He Gly 
105 



Gin Thr Glu Xaa 
15 

Ala Ser Trp Cys 
30 

Xaa Ala Asp Asp 
45 

Asp Glu Asn Pro 

Thr Met Xaa Xaa 

80 

Ala Xaa Pro Lys 
95 



<210> 102 
<211> 102 
<212> PRT 
<213> Chlamydia 

<400> 102 
Met Val Gin He 
1 

Gly Leu Val Leu 

20 

Met Leu Thr Pro 
35 

Thr He Leu Lys 
50 

Phe Gly Val Ser 
65 

Glu Val Glu Arg 

Leu He Ser Lys 

100 



muridarum 



Val Ser Gin Asp 
5 

Val Asp Phe Phe 

Val Leu Glu Ala 

40 

Leu Asp He Asp 
55 

Ser He Pro Thr 
70 

Ser Val Gly Leu 
85 

His Gin 



Asn Phe Ala Asp 
10 

Ala Glu Trp Cys 
25 

Leu Ala Ala Glu 

Ala Ser Pro Arg 

60 

Leu He Leu Phe 
75 

Lys Asp Lys Asp 
90 



Ser lie Ala Ser 
15 

Gly Pro Cys Lys 
30 

Leu Pro Tyr Val 
45 

Pro Ala Glu Gin 

Lys Asp Gly Lys 

80 

Ser Leu Val Lys 
95 



<210> 103 
<211> 102 
<212> PRT 
<213> Chlamydia 

<400> 103 
Met Val Lys He 
1 

Gly Leu Val Leu 

20 

Met Leu Thr Pro 
35 

Thr He Gly Lys 
50 

Tyr Glu Val Ser 
65 

Glu Val Ala Arg 

Leu He Asn Lys 

100 



pneumoniae 



He 


Ser 


Ser 


Glu 


5 








Val 


Asp 


Phe 


Phe 


He 


Leu 


Glu 


Asn 








40 


He 


Asn 


He 


Asp 






55 




Ser 


He 


Pro 


Thr 




70 






Val 


Val 


Gly Leu 


85 








His 


Ala 







Asn Phe Asp Ser 
10 

Ala Glu Trp Cys 
25 

Leu Ala Ala Glu 

Glu Asn Ser Lys 

60 

Leu He Leu Phe 
75 

Lys Asp Lys Glu 
90 



Phe lie Ala Ser 
15 

Gly Pro Cys Arg 
30 

Leu Pro His Val 
45 

Pro Ala Glu Thr 

Lys Asp Gly Asn 

8 0 

Phe Leu Thr Asn 
95 



<210> 104 
<211> 102 
<212> PRT 
<213> Psittaci 
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<400> 104 
Met Val Lys Val 
1 

Gly Leu Val Leu 

20 

Met Leu Thr Pro 
35 

Leu He Gly Lys 
50 

Tyr Gly Val Ser 
65 

Glu Val Asp Arg 

Leu He Asn Gin 

100 



Val Ser Ala Glu 
5 

lie Asp Phe Phe 

Val Leu Glu Ser 

40 

Val Asn He Asp 
55 

Ser He Pro Thr 
70 

Val Val Gly Leu 
85 

His Ser 



Asn Phe Asn Ser 
10 

Ala Glu Trp Cys 
25 

Leu Glu Ala Glu 

Asp His Pro Ala 

60 

Leu He Leu Phe 
75 

Lys Asp Lys Asp 
90 



Phe He Ala Thr 
15 

Gly Pro Cys Lys 
30 

Val Ser Ser Val 
45 

Pro Ala Glu Gin 

Lys Asp Gly Lys 

80 

Ser Leu He Arg 
95 



<210> 105 
<211> 102 
<212> PRT 

<213> Chlamydia trachomatis 



<400> 105 
Met Val Gin Val 
1 

Gly Leu Val Leu 

20 

Met Leu Thr Pro 
35 

Thr He Leu Lys 
50 

Tyr Ser Val Ser 
65 

Glu Val Glu Arg 

Leu He Ser Lys 

100 



Val Ser Gin Glu 
5 

He Asp Phe Phe 

Val Leu Glu Ala 

40 

Val Asp He Asp 
55 

Ser He Pro Thr 
70 

Ser Val Gly Leu 
85 

His Gin 



Asn Phe Ala Asp 
10 

Ala Glu Trp Cys 
25 

Leu Ala Ala Glu 

Ser Ser Pro Arg 

60 

Leu He Leu Phe 
75 

Lys Asp Lys Asp 
90 



Ser He Ala Ser 
15 

Gly Pro Cys Lys 
30 

Leu Pro His Val 
45 

Pro Ala Glu Gin 

Lys Asp Gly Lys 

80 

Ser Leu He Lys 
95 



<210> 106 • 
<211> 105 
<212> PRT 

<213> Cornybacterium nephridii 



<400> 106 




Ala 


Thr 


Val 


Lys 


1 








Ser 


Ser 


Glu 


Pro 








20 


Cys 


Lys 


Met 


He 






35 




Gly 


Gin 


Val 


Lys 




50 






Ala 


Ala 


Gin 


Phe 


65 








Asp 


Gly Glu 


Leu 


Leu 


Ala 


Asp 


Trp 








100 



Val 


Asp 


Asn 


Ser 


5 








Val 


Val 


Val 


Asp 


Ala 


Pro 


Ala 


Leu 








40 


He 


Ala 


Lys 


Val 






55 




Gly 


Val 


Arg 


Ser 




70 






Ala 


Ala 


Asn 


Met 


85 








He 


Lys 


Ala 


Ser 



Asn Phe Gin Ser 
10 

Phe Trp Ala Glu 
25 

Asp Glu He Ala 

Asn He Asp Glu 

60 

He Pro Thr Leu 
75 

Val Gly Ala Ala 
90 

Ala 
105 



Asp Val Leu Gin 
15 

Trp Cys Gly Pro 
30 

Thr Glu Met Ala 
45 

Asn Pro Glu Leu 

Leu Met Phe Lys 

80 

Pro Lys Ser Arg 
95 



<210> 107 
<211> 107 
<212> PRT 

<213> Cornybacterium nephridii 
<400> 107 
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Ser 


Ala 


Thr 


lie 


1 








Leu Asp 


Ala 


LsJ-U 








20 


Ala 


Pro 


Cys 


Lys 






35 




Tyr 


Ala 


Gly 


Lys 




50 






Asp 


Thr 


Ala 


Val 


65 








Phe 


Lys 


Asp 


Gly 


Ser 


Lys 


Leu 


Ala 








100 



Val 


Asn 


Thr 


Thr 


5 








Thr 


Pro 


Val 


Leu 


Ala 


He 


Ala 


Pro 








40 


Val 


Lys 


He 


Val 






55 




Lys 


Tyr 


Asn 


He 




70 






Glu 


Val 


Val 


Ala 


85 








Ala 


Phe 


He 


Asp 



Asp Glu Asn Phe 
10 

Val Asp Phe Trp 
25 

Val Leu Glu Glu 

Lys Val Asp Val 

60 

Arg Asn He Pro 
75 

Gin Gin Val Gly 
90 

Gin Asn He 
105 



Gin Ala Asp Val 
15 

Ala Gly Trp Cys 
30 

Leu Ser Asn Glu 
45 

Thr Ser Cys Glu 

Ala Leu Leu Met 

80 

Ala Ala Pro Arg 
95 



<210> 108 
<211> 145 
<212> PRT 

<213> Cornybacterium nephridii 



<400> 108 
Met He He Val 
1 

Glu Lys Leu Ala 

20 

Leu Pro Leu Glu 
35 

He Ser Asn Ser 

50 

Cys Gly Pro Cys 
65 

Gin Asn Pro Tyr 

Arg Leu Ser Ala 

10 0 

Met Asn Lys Thr 
115 

Leu Glu Leu Gin 
130 

Asn 
145 



Cys Ala Ser Cys 
5 

Val His Pro Asn 

Pro He Glu Leu 

40 

Asp Leu Pro Val 
55 

Lys Met Met Ala 
70 

Val Val Phe Ala 
85 

Ala Phe Asn Val 

Thr Glu Val Ala 

120 

Gin Trp Leu Asp 
135 



Gly Ala Lys Asn 
10 

Cys Gly Gin Cys 
25 

Asn Glu Gin Asn 

Leu He Asp Leu 

60 

Pro His Phe Ala 
75 

Lys He Asp Thr 
90 

Arg Ser He Pro 
105 

Arg He Ser Gly 

Gin Gin Leu Gin 

140 



Arg Val Pro Glu 
15 

His Gin Ala Leu 
30 

Phe Ser Asn Phe 
45 

Trp Ala Glu Trp 

Gin Val Ala Lys 

80 

Glu Ala Asn Pro 
95 

Thr Leu Val Leu 
110 

Ala Leu Arg . Thr 
125 

Gin Gin Gin Gly 



<210> 109 
<211> 107 
<212> PRT 

<213> Chroma tium vinos um 



<220> 

<221> VARIANT 

<222> 17, 38, 42, 55, 58, 60, 72, 107 
<223> Xaa = Any Amino Acid 



<400> 109 




Ser 


Asp 


Ser 


He 


1 








Xaa 


Lys 


Ser 


Pro 






20 


Gly 


Pro 


Cys 


Lys 






35 




Tyr 


Ala 


Gly 


Arg 




50 






Asn 


Thr 


Pro 


Pro 


65 








Phe 


Arg 


Gly 


Gly 



Val His Val Thr 
5 

Asp Pro Val Leu 

Met Xaa Ala Pro 

40 

Val Lys Xaa Ala 
55 

Arg Tyr Gly Xaa 
70 

Glu Val Glu Ala 
85 



Asp Asp Ser Phe 
10 

Val Asp Tyr Trp 
25 

Val Xaa Asp Glu 

Lys Xaa Asn Xaa 

60 

Arg Gly He Pro 
75 

Thr Lys Val Gly 
90 



Glu Glu Glu Val 
15 

Ala Asp Trp Cys 
30 

He Ala Asp Glu 
45 

Asp Glu Asn Pro 

Thr Leu Met Leu 

80 

Ala Val Ser Lys 
95 
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Ser Gin Leu Thr Ala Phe Leu Asp Ser Asn Xaa 

100 105 



<210> 110 
<211> 107 
<212> PRT 

<213> Clostridium litorale 
<400> 110 

et Leu Aso Lvs Asp Thr Phe Lys Thr Glu Val Leu Glu Gl: 

15 

Gly Cys Val Pro Cy: 
3 0 

Lys Lys Tyr Glu Gl; 
45 

Ala Arg Arg Leu Al< 
60 

Ser Leu Tyr Lys As] 

80 

Ala Thr He Glu As: 

95 



Met 


Leu 


Met 


Leu 


Asp 


Lys 


Asp 


Thr 


Phe 


Lys 


Thr 


1 








5 










10 




Thr 


Gly 


Tyr 


Val 


Leu 


Val 


Asp 


Tyr 


Phe 


Ser 


Asp 




20 










25 






Lys 


Ala 


Leu 


Met 


Pro 


Ala 


Val 


Glu 


Glu 


Leu 


Ser 




35 










40 








Arg 


Val 


Val 


Phe 


Ala 


Lys 


Leu 


Asn 


Thr Thr Gly 


50 










55 










He 


Ser 


Gin 


Lys 


He 


Leu 


Gly 


Leu 


Pro 


Thr 


Leu 


65 








70 










75 


Gly 


Val 


Lys 


Val 


Asp 


Glu 


Val 


Thr 


Lys 


Asp 


Asp 






85 










90 




He 


Glu 


Ala 


Met 


Val 


Glu 


Glu 


His 


He 


Ser 


Lys 








100 










105 






<210> 111 



















<211> 40 
<212> PRT 

<213> Clostridium sporogenes 

Met Val Leu Asp Lys Lys Thr Phe Glu Glu Glu Val Leu Lys Thr 

1 5 10 15 

Lys Gly Tyr Val Leu Val Asp Tyr Phe Gly Asp Gly Cys Val Pro Cys 

20 25 30 

Glu Ala Leu Met Pro Asp Val Glu 
35 40 



<210> 112 
<211> 33 
<212> PRT 

<213> Clostridium sticklandii 

<400> 112 , _ „ - « 

Met Phe Glu Leu Asp Lys Asp Thr Phe Glu Thr Glu Val Leu Gin Gly 

x 5 ' ' 10 15 

Thr Gly Tyr Val Leu Val Asp Phe Trp Ser Glu Gly Cys Glu Pro Cys 

20 25 30 

Lys 



<210> 113 
<211> 106 
<212> PRT 

<213> Coprinus comatus 
<400> 113 

Met Val Gin Val He Ser Asn Leu Asp Glu Phe Asn Lys Leu Thr Asn 

1 5 10 15 

Ser Gly Lys He He He He Asp Phe Trp Ala Thr Trp Cys Gly Pro 

20 25 30 

Cys Arg Val He Ser Pro He Phe Glu Lys Phe Ser Glu Lys Tyr Gly 

35 40 45 

Ala Asn Asn He Val Phe Ala Lys Val Asp Val Asp Thr Ala Ser Asp 
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50 

He Ser Glu Glu Ala 
65 

Lys Asp Gly Gin Lys 

85 

Leu Glu Ser Leu Val 

100 



55 

Lys He Arg Ala Met 
70 

He Asp Glu Leu Val 

90 

Gin Lys Ser Leu Ala 

105 



60 

Pro Thr Phe Gin Val Tyr 
75 80 
Gly Ala Asn Pro Thr Ala 

95 



<210> 114 
<211> 105 
<212> PRT 

<213> Dictyostelium discoideum 



<400> 114 




Met 


Ser 


Asn 


Arg 


1 








His 


Leu 


Arg 


Asp 








20 


Gly 


Pro 


Cys 


Arg 






35 




Phe 


He 


Thr 


Phe 




50 






His 


Pro 


He 


Val 


65 








Arg 


Asn 


Gly 


Ser 


Leu 


Arg 


Ser 


Thr 








100 



Val He His Val 
5 

Glu Arg Val Val 

Ala He Ser Pro 

40 

Thr Phe Leu His 
55 

Ser Lys He Lys 
70 

Lys Val Ser Glu 
85 

Leu Glu Ala Asn 



Ser Ser Cys Glu 
10 

Val Asp Phe Ser 
25 

Val Phe Glu Lys 

Val Asp He Asp 

60 

Ser Val Pro Thr 
75 

Phe Ser Gly Ala 
90 

Lys 
105 



Glu Leu Asp Lys 
15 

Ala Val Trp Cys 
30 

Leu Ser Asn Glu 
45 

Lys Leu Asn Val 

Phe His Phe Tyr 

80 

Ser Glu Ser He 
95 



<210> 115 
<211> 88 
<212> PRT 

<213> Dictyostelium discoideum 



<400> 115 
Met Ser Arg Val 
1 

Leu Gin Ala Glu 

20 

Pro Cys Arg Ala 
35 

Val Thr Phe Thr 
50 

Pro He Val Lys 
65 

Asn Gly Ala Lys 



He His He Ser 
5 

Arg Leu Val He 

He Ser Pro Val 

40 

Phe Val His Val 
55 

Glu He Arg Ser 
70 

Val Ser Glu Phe 
85 



Ser Asn Glu Glu' 
10 

Asp Phe Ser Ala 
25 

Phe Glu Lys Leu 

Asp He Asp Lys 

60 

Val Pro Thr Phe 
75 



Leu Asp Lys His 
15 

Ala Trp Cys Gly 
30 

Ser Asn Glu Phe 
45 

Leu Ser Gly His 

Tyr Phe Tyr Arg 

80 



<210> 116 
<211> 88 
<212> PRT 

<213> Dictyostelium discoideum 



Me^Se^Arq Val He His He Ser Ser Asn Glu Glu Leu Asp Lys His 

1 5 10 15 

Leu Gin Ala Glu Arg Leu Val He Asp Phe Ser Ala Ala Trp Cys Gly 

20 25 30 

Pro Cys Arg Ala He Ser Pro Val Phe Glu Lys Leu Ser Asn Glu Phe 



3 5 40 45 

Val Thr Phe Thr Phe Val His Val Asp He Asp Lys Leu Ser Gly His 

Pro He Val Lys Glu He ^g Ser Val Pro Thr Phe Tyr Phe Tyr Arg 
65 70 75 80 
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Asn Gly Ala Lys Val Ser Glu Phe 

85 



<210> 117 
<211> 108 
<212> PRT 

<213> E coli, salmonella typhimurium 



<400> 117 




Ser 


Asp 


Lys 


lie 


1 








Leu 


Lys 


Ala 


Asp 








20 


Gly 


Pro 


Cys 


Lys 






35 




Tyr 


Gin 


Gly 


Lys 




50 






Gly 


Thr 


Ala 


Pro 


65 








Phe 


Lys 


Asn 


Gly 


Gly 


Gin 


Leu 


Lys 






100 



lie His Leu Thr 
5 

Gly Ala He Leu 

Met He Ala Pro 

40 

Leu Thr Val Ala 
55 

Lys Tyr Gly He 
70 

Glu Val Ala Ala 
85 

Glu Phe Leu Asp 



Asp Asp Ser Phe 
10 

Val Asp Phe Trp 
25 

lie Leu Asp Glu 

Lys Leu Asn He 

60 

Arg Gly lie Pro 
7 5 

Thr Lys Val Gly 
90 

Ala Asn Leu Ala 
105 



Asp Thr Asp Val 
15 

Ala Glu Trp Cys 
30 

lie Ala Asp Glu 
45 

Asp Gin Asn Pro 

Thr Leu Leu Leu 

80 

Ala Leu Ser Lys 
95 



<210> 118 
<211> 105 
<212> PRT 

<213> Synechocystis 



<400> 118 
Met Ala Val Lys 
1 

Ser Pro Lys Pro 

20 

Cys Gin Met Met 
35 

Gin Gin He Gin 
50 

Ala Thr Gin Tyr 
65 

Gin Gly Gin Pro 

Leu He Gin Gin 

100 



Lys Gin Phe Ala 
5 

Val Leu Val Asp 

Ala Pro He Leu 

40 

Val Val Lys He 
55 

Gin He Gin Ser 
70 

Val His Arg Met 
85 

Leu Gin Val Phe 



Asn Phe Ala Glu 

10 

Phe Tyr Ala Thr 
25 

Glu Gin Val Gly 

Asp Thr Asp Lys 

60 

Leu Pro Thr Leu 
75 

Glu Gly Val Gin 
90 

Val 
105 



Met Leu Ala Gly 

15 

Trp Cys Gly Pro 
30 

Ser His Leu Arg 
45 

Tyr Pro Ala He 

Val Leu Phe Lys 

80 

Gin Ala Ala Gin 
95 



<210> 119 
<211> 139 
<212> PRT 
<213 > E . coli 



<400> 119 - 

Met Asn Thr Val Cys Thr His Cys Gin Ala He Asn Arg He Pro Asp 

! 5 10 15 

Asp Arg He Glu Asp Ala Ala Lys Cys Gly Arg Cys Gly His Asp Leu 

20 25 30 

Phe Asp Gly Glu Val He Asn Ala Thr Gly Glu Thr Leu Asp Lys Leu 

35 40 45 

Leu Lys Asp Asp Leu Pro Val Val He Asp Phe Trp Ala Pro Trp Cys 

50 55 60 

Gly Pro Cys Arg Asn Phe Ala Pro He Phe Glu Asp Val Ala Gin Glu 
65 70 75 80 

Arq Ser Gly Lys Val Arg Phe Val Lys Val Asn Thr Glu Ala Glu Arg 

85 90 95 

Glu Leu Ser Ser Arg Phe Gly He Arg Ser He Pro Thr He Met He 
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100 

Phe Lys Asn Gly 
115 

Ala Pro Phe Asp 
130 



Gin Val Val Asp 

120 

Ser Trp Leu Asn 
135 



105 

Met Leu Asn Gly 
Glu Ser Leu 



110 

Ala Val Pro Lys 
125 



<210> 120 
<211> 110 
<212> PRT 

<213> Eubacterium acidaminophilum 
<400> 120 



Met 


Ser 


Ala 


Leu 


Leu 


Val 


Glu 


He 


Asp 


Lys 


Asp 


Gin 


Phe 


Gin 


Ala 


Glu 


1 








5 










10 










15 




Val 


Leu 


Glu 


Ala 


Glu 


Gly 


Tyr 


Val 


Leu 


Val 


Asp 


Tyr 


Phe 


Ser 


Asp 


Gly 








20 






25 










30 






Cys 


Val 


Pro 


Cys 


Lys 


Ala 


Leu 


Met 


Pro 


Asp 


Val 


Glu 


Glu 


Leu 


Ala 


Ala 




35 






40 










45 








Lys 


Tyr 


Glu 


Gly 


Lys 


Val 


Ala 


Phe 


Arg 


Lys 


Phe 


Asn 


Thr 


Ser 


Ser 


Ala 


50 










55 










60 








Thr 


Arg 


Arg 


Leu 


Ala 


lie 


Ser 


Gin 


Lys 


He 


Leu 


Gly 


Leu 


Pro 


Thr 


He 


65 








70 










75 










80 


Leu 


Tyr 


Lys 


Gly 


Gly 


Gin 


Lys 


Val 


Glu 


Glu 


Val 


Thr 


Lys 


Asp 


ASp 


Ala 






85 










90 










95 




Thr 


Arg 


Glu 


Asn 


He 


Asp 


Ala 


Met 


He 


Ala 


Lys 


His 


Val 


Gly 










100 








105 










110 







<210> 121 
<211> 107 
<212> PRT 

<213> Haemophilus influenzae 
<400> 121 



Met 


Ser 


Glu 


Val 


Leu 


His 


He 


Asn 


Asp 


Ala Asp 


Phe 


Glu 


Ser 


Val 


Val 


1 








5 










10 








15 




Val 


Asn 


Ser 


Asp 


He 


Pro 


He 


Leu 


Leu 


Asp Phe 


Trp 


Ala 


Pro 


Trp 


Cys 








20 










25 








30 






Gly 


Pro 


Cys 


Lys 


Met 


He 


Ala 


Pro 


Val 


Leu Asp 


Glu 


Leu 


Ala 


Pro 


Glu 




35 








40 








45 








Phe 


Ala 


Gly 


Lys 


Val 


Lys 


He 


Val 


Lys 


Met Asn 


Val 


Asp 


Asp 


Asn 


Gin 




50 








55 








60 










Ala 


Thr 


Pro 


Ala 


Gin 


Phe 


Gly Val 


Arg 


Ser He 


Pro 


Thr 


Leu 


Leu 


Leu 


65 










70 








75 










80 


He 


Lys 


Asn 


Gly 


Gin 


Val 


Val 


Ala 


Thr 


Gin Val 


Gly 


Ala 


Leu 


Pro 


Lys 






85 










90 








95 




Thr 


Gin 


Leu 


Ala 


Asn 


Phe 


He 


Asn 


Gin 


His He 













100 105 



<210> 122 
<211> 167 
<212> PRT 

<213> Haemophilus influenzae 
<400> 122 

Met Lys He Lys Lys Leu Leu Lys 

1 ~ 5 

Phe He Val He Thr Ser He Leu 

20 

Pro Glu Glu lie Asn Lys He Thr 

35 40 
Phe Ser Leu Glu Ser Leu Asp Gin 

50 55 
Trp Gly Thr Trp Cys Gly Tyr Cys 
65 ~ 70 



Asn Gly Leu Ser Leu Phe Leu Thr 

10 15 
Asp Phe Val Arg Arg Pro Val Val 
25 30 
Leu Gin Asp Leu Gin Gly Asn Thr 

45 

Asn Lys Pro Thr Leu Leu Tyr Phe 

60 

Arg Tyr Thr Ser Pro Ala He Asn 
75 8 0 
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Ser Leu Ala Lys 

Gly Asn Glu Ala 

100 

Phe Thr Thr Val 
115 

lie Asn Val Thr 
130 

Leu Val Thr Thr 
145 

Phe Phe Ala Glu 



Glu Gly Tyr Gin 
85 

Asp Val Asn Asp 

Asn Asp Pro Lys 

120 

Pro Thr lie Val 
135 

Gly Leu Thr Ser 
150 

Phe Phe Gly 
165 



Val Val Ser Val 
90 

Tyr Leu Ser Lys 
105 

Gly Glu Phe Ala 

Leu Leu Ser Lys 

140 

Tyr Trp Gly Leu 
155 



Ala Leu Arg Ser 
95 

Asn Asp Tyr His 
110 

Glu Arg Trp Gin 
125 

Gly Lys Met Asp 

Lys Val Arg Leu 

160 



<210> 123 
<211> 106 
<212> PRT 

<213> Helicobacter pylori 



<400> 123 




Met 


Ser 


His 


Tyr 


1 








Lys 


Lys 


Gly 


Val 








20 


Cys 


Lys 


Met 


Leu 






35 




Gly 


Lys 


Ala 


Lys 


50 






Ser 


Ala 


Lys 


Phe 


65 








Asp 


Gly 


Glu 


Val 


Leu 


Lys 


Glu 


Gin 






100 



lie Glu Leu Thr 
5 

Ala Leu Val Asp 

Ser Pro Val He 

40 

He Cys Lys Val 
55 

Gly He Arg Ser 
70 

Val His Gin Leu 
85 

Leu Asn Lys Leu 



Glu Glu Asn Phe 
10 

Phe Trp Ala Pro 
25 

Asp Glu Leu Ala 

Asn Thr Asp Glu 

60 

He Pro Thr Leu 
75 

Val Gly Val Gin 
90 

Leu Gly 
105 



Glu Ser Thr He 
15 

Trp Cys Gly Pro 
30 

Ser Glu Tyr Glu 
45 

Gin Glu Glu Leu 

Leu Phe Thr Lys 

80 

Thr Lys Val Ala 
95 



.<210> 124 
<211> 103 
<212> PRT 

<213> Listeria monocytogenes 
<400> 124 



Met 


Val 


Lys 


Glu 


He 
5 


Thr 


Asp 


Ala 


1 
Gly 


Leu 


Val 


Leu 


Thr 


Asp 


Phe 


Trp 






20 










Met 


Val 


Ala 


Pro 


Val 


Leu 


Glu 


Glu 






35 










40 


Leu 


Lys 


He 


Val 


Lys 


Met 


Asp 


Val 




50 










55 




Ser 


Phe 


Gly 


Val 


Met 


Ser 


He 


Pro 


65 








70 






Glu 


Val 


Val 


Glu 


Thr 


He 


He 


Gly 










85 








Glu 


Val 


He 


Asn 


Lys 


Tyr 


Val 










100 











Thr Phe Glu Gin Glu Thr Ser Glu 

10 15 
Ala Thr Trp Cys Gly Pro Cys Arg 
25 30 
He Gin Glu Glu Arg Gly Glu Ala 

45 

Asp Glu Asn Pro Glu Thr Pro Gly 

60 

Thr Leu Leu He Lys Lys Asp Gly 

75 80 
Tyr Arg Pro Lys Glu Glu Leu Asp 
90 35 



<210> 125 
<211> 85 
<212> PRT 

<213> Methoanococus jannaschu 



Met°Ser 2 Lys Val Lys He Glu Leu Phe Thr Ser Pro Met Cys Pro His 

1 5 10 

Cys Pro Ala Ala Lys Arg Val Val Glu Glu Val Ala Asn Glu Met Pro 
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20 

Asp Ala Val Glu Val 
35 

Ala Met Glu Tyr Gly 
50 

Asp Val Glu Phe lie 
65 

lie Lys Lys Arg Leu 

85 



25 

Glu Tyr lie Asn Val Met 
40 

He Met Ala Val Pro Thr 
55 

Gly Ala Pro Thr Lys Glu 
70 75 



30 

Glu Asn Pro Gin Lys 
45 

He Val He Asn Gly 
SO 

Ala Leu Val Glu Ala 

80 



<210> 126 
<211> 102 
<212> PRT 
<213> Mycoplasma 



genitalium 



<400> 126 




Met 


Val 


Thr 


Glu 


1 








Ala 


Lys 


Lys 


Asn 








20 


CyB 


Lys 


Leu 


Thr 






35 




Asp 


Ala 


Gin 


Phe 




50 






Ala 


Ala 


Tyr 


Asn 


65 








Gly Val 


Glu 


Lys 


He 


Asp 


Leu 


Phe 








100 



He Arg Ser Leu 
5 

Val lie Val Asp 

Ser Pro Glu Phe 

40 

Val Lys Val Asn 
55 

He Thr Ser Leu 
70 

Lys Arg Ala lie 
85 

Asn Asn 



Lys Gin Leu Glu 
10 

Phe Trp Ala Ala 
25 

Gin Lys Ala Ala 

Val Asp Asp His 

60 

Pro Thr He Val 
75 

Gly Phe Met Pro 
90 



Glu He Phe Ser 
15 

Trp Cys Gly Pro 
30 

Asp Glu Phe Ser 
45 

Thr Asp He Ala 

Val Phe Glu Asn 

80 

Lys Thr Lys lie 
95 



<210> 127 
<211> 458 
<212> PRT 

<213> mycobacterium leprae 



<400> 127 
Met Asn Thr Thr 
1 

He Gly Ser Gly 

20 

Ala Gin Leu Thr 
35 

Leu Met Thr Thr 
50 

He Thr Gly Pro 
65 

Phe Gly Ala Glu 

Gly Pro He Lys 

100 

Arg Ala Val He 
115 

Pro Gly Glu Gin 
130 

Cys Asp Gly Ser 
145 

Gly Asp Ser Ala 

Ser Val Thr Leu 

180 

Met Leu Gly Arg 
195 

His Thr Val Val 
210 



Pro Ser Ala His 
5 

Pro Ala Gly Tyr 

Pro Leu Val Phe 

40 

Thr Glu Val Glu 
55 

Glu Leu Met Asp 
70 

Leu Arg Thr Glu 
85 

Ser Val Val Thr 

Leu Ala Met Gly 

12 0 

Glu Leu Leu Gly 
135 

Phe Phe Arg Gly 
150 

Met Glu Glu Ala 
165 

Val His Arg Arg 

Ala Arg Asn Asn 

200 

Ala Val Asn Gly 
215 



Glu Thr He His 
10 

Thr Ala Ala Leu 
25 

Glu Gly Thr Ser 

Asn Tyr Pro Gly 

60 

Asp Met Arg Glu 
75 

Asp Val Glu Ser 
90 

Ala Glu Gly Gin 
105 

Thr Ser Val Arg 

Arg Gly Val Ser 

140 

Gin Asp He Ala 
155 

Leu Phe Leu Thr 
170 

Asp Glu Phe Arg 
185 

Asp Lys He Lys 

Tyr Thr Thr Val 

220 



Glu Val He Val 
15 

Tyr Ala Ala Arg 
30 

Phe Gly Gly Ala 
45 

Phe Arg Asn Gly 

Gin Ala Leu Arg 

80 

Val Ser Leu Arg 
95 

Thr Tyr Gin Ala 
110 

Tyr Leu Gin lie 
125 

Ala Cys Ala Thr 

Val He Gly Gly 

160 

Arg Phe Ala Arg 
17 5 

Ala Ser Lys He 
190 

Phe He Thr Asn 
205 

Thr Gly Leu Arg 
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Leu Arg Asn Thr 

225 

' Val Phe Val Ala 

Val Val Asp He 

260 

Thr Ser Thr Ser 
275 

Arg Thr Tyr Arg 
290 

Ala He Asp Ala 
305 

Asn Glu Thr Thr 

Asp Trp Ser Thr 

340 

Glu Val Thr Asp 
355 

Pro Val Leu Val 
370 

Val Ala Pro Val 
385 

Thr Val Ala Lys 

Phe Gin Val Val 

420 

Pro Val Lys Arg 
435 

Asp Leu Ser Asp 
450 



Thr Thr Gly Glu 
230 

He Gly His Glu 
245 

Asp Pro Asp Gly 

Met Asp Gly Val 

280 

Gin Ala He Thr 
295 

Glu Arg Trp Leu 
310 

Glu Glu Thr Gly 
325 

Ala Met Thr Asp 

Ala Ser Phe Phe 

360 

Asp Phe Trp Ala 
375 

Leu Glu Glu He 
390 

Leu Asp Val Asp 
405 

Ser He Pro Thr 

He Val Gly Ala 

440 

Val Val Pro Asn 
455 



Glu Thr Thr Leu 

235 

Pro Arg Ser Ser 
250 

Tyr Val Leu Val 
265 

Phe Ala Ala Gly 

Ala Ala Gly Ser 

300 

Ala Glu His Ala 
315 

Asp Val Asp Ser 
330 

Ala Lys Asn Ala 
345 

Ala Asp Val Leu 

Thr Trp Cys Gly 

380 

Ala Ser Glu Gin 
395 

Thr Asn Pro Glu 
410 

Met He Leu Phe 
425 

Lys Gly Lys Ala 
Leu Asn 



Val Val Thr Gly 

240 

Leu Val Ser Asp 

255. 

Lys Gly Arg Thr 
270 

Asp Leu Val Asp 
285 

Gly Cys Ala Ala 

Gly Ser Lys Ala 

320 

Thr Asp Thr Thr 
335 

Gly Val Thr He 
350 

Ser Ser Asn Lys 
365 

Pro Cys Lys Met 

Arg Asn Gin Leu 

400 

Met Ala Arg Glu 
415 

Gin Gly Gly Gin 
430 

Ala Leu Leu Arg 
445 



<210> 128 
<211> 102 
<212> PRT 
<213> Mycoplasma 



pneumoniae 



<400> 128 
Met Val Thr Glu 
1 

Ser Asn Asn Lys 

20 

Cys Lys He Thr 
35 

Thr Val Ala Phe 
50 

Ala Ala Tyr Lys 
65 

Gly Gin Glu Lys 

Val Gin Leu Val 

100 



He Lys Ser Leu 
5 

Val He He Asp 

Gly Pro Glu Phe 

40 

Ala Lys Val Asn 
55 

He Thr Ser Leu 
70 

His Arg Ala He 
85 

Ser Gin 



Lys Gin Leu Gly 
10 

Phe Trp Ala Glu 
25 

Ala Lys Ala Ala 

Val Asp Glu Gin 

60 

Pro Thr He Val 
75 

Gly Phe Met Pro 
90 



Glu Leu Phe Ala 
15 

Trp Cys Gly Pro 
30 

Ser Glu Val Ser 
45 

Thr Asp He Ala 

Leu Phe Glu Lys 

80 

Lys Ala Lys He 
95 



<210> 129 
<211> 112 
<212> PRT 

<213> Mycobacterium smegmatis 

Met°Ser 2 Glu Asp Ser Ala Thr Val Ala Val Thr Asp Asp Ser Phe Ser 

1 5 10 15 t 

Thr Asp Val Leu Gly Ser Ser Lys Pro Val Leu Val Asp Phe Trp Ala 

20 25 30 

Thr Trp Cys Gly Pro Cys Lys Met Val Ala Pro Val Leu Glu Glu He 

35 A 40 45 

Ala Ala Glu Lys Gly Asp Gin Leu Thr Val Ala Lys He Asp Val Asp 
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50 

Val Asp Ala Asn Pro 
65 

Pro Thr Met lie Leu 

85 

Gly Ala Lys Gly Lys 

100 



55 

Ala Thr Ala Arg Asp Phe 
70 75 
Phe Lys Asp Gly Ala Pro 

90 

Ala Ala Leu Leu Arg Glu 

105 



60 

Gin Val Val Ser He 

80 

Val Lys Arg He Val 

95 

Leu Ser Asp Ala Leu 
110 



<210> 130 
<211> 115 
<212> PRT 

<213> Mycobacterium tuberculosis 



<400> 130 






















Phe 


Thr 


Asp 


Ser 


Glu 


Lys 


Ser 


Ala Thr He 


Lys 


Val 


Thr 


Asp 


Ala 


Ser 


1 






5 






10 










15 




Ala 


Thr 


Asp 


Val 


Leu 


Ser 


Ser Asn Lys 


Pro 


Val 


Leu 


Val 


Asp 


Phe 


Trp 






20 






25 










30 




Glu 


Ala 


Thr 


Trp 


Cys 


Gly 


Pro 


Cys Lys Met 


Val 


Ala 


Pro 


Val 


Leu 


Glu 






35 






40 








45 






Val 


He 


Ala 


Thr 


Glu 


Arg 


Ala 


Thr Asp Leu 


Thr 


Val 


Ala 


Lys 


Leu Asp 




50 










55 






60 






He 




Asp 


Thr 


Asn 


Pro 


Glu 


Thr Ala Arg Asn 


Phe 


Gin 


Val 


Val 


Ser 


Pro 


65 










70 






75 








Val 


80 


Thr 


Leu 


He 


Leu 


Phe 


Lys 


Asp Gly Gin 


Pro 


Val 


Lys 


Arg 


lie 


Gly 










B5 


90 










95 




Ala 


Lys 


Gly 


Lys 


Ala 


Ala 


Leu Leu Arg 


Glu 


Leu 


Ser 


Asp 


Val 


Val 


Pro 




100 






105 










110 






Asn 


Leu 


Asn 
115 

























<210> 131 
<211> 127 
<212> PRT 

<213> Neurospara crass a 



<400> 131 
























Ala 




Met 


Ser 


Asp 


Gly 


Val 


Lys 


His 


He 


Asn 


Ser 


Ala 


Gin 


Glu 


Phe 


Asn 


1 




5 










10 










15 




Leu 


Leu 


Asn 


Thr 


Thr 


Gin 


Tyr 


Val 


Val 


Ala 


Asp 


Phe 


Tyr 


Ala 


Asp 


Trp 








20 








25 










30 






Cys 


Gly 


Pro 


Cys 


Lys 


Ala 


He 


Ala 


Pro 


Met 


Tyr 


Ala 


Gin 


Phe 


Ala 


Lys 


35 








40 










45 




Val 




Thr 


Phe 


Ser 


He 


Pro 


Asn 


Phe 


Leu 


Ala 


Phe 


Ala 


Lys 


He 


Asn 


Asp 




50 










55 










60 








Thr 


Ser 


Val 


Gin 


Gin 


Val 


Ala 


Gin 


His 


Tyr Arg 


Val 


Ser 


Ala 


Met 


Pro 


65 










70 










75 










80 


Phe 


Leu 


Phe 


Phe 


Lys 


Asn 


Gly 


Lys 


Gin 


Val 


Ala 


Val 


Asn 


Gly 


Ser 


Val 










85 








90 










95 




Met 


He 


Gin 


Gly 


Ala 


Asp 


Val 


Asn 


Ser 


Leu 


Arg 


Ala 


Ala 


Ala 


Glu 


Lys 








100 








105 










110 






Met 


Gly 


Arg 


Leu 


Ala 


Lys 


Glu 


Lys 


Ala 


Ala 


Ala 


Ala 


Gly 


Ser 


Ser 






115 










120 










125 









<210> 132 
<211> 106 
<212> PRT 

<213> Penicillium chrysogenum 
<400> 132 

Met Gly Val Thr Pro He Lys Ser 

1 5 
Thr Asp Ala Thr Gly Pro Val Val 

20 



Val Ala Glu Tyr Lys Glu Lys Val 

10 15 
Val Asp Phe His Ala Thr Trp Cys 
25 ~ 30 
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Gly Pro Cys Lys 
35 

His Thr Gly He 
50 

Val Ala Ala Ser 
65 

Lys Gly Gly Glu 

He Gin Ala Gly 

100 



Ala He Ala Pro 

40 

Gin Phe Tyr Lys 
55 

Asn Gly Val Ser 

70 ' 

Arg Asn Glu Glu 
85 

Val Lys Ala He 



Ala Leu Glu Lys 

Val Asp Val Asp 

60 

Ala Met Pro Thr 
75 

Val Lys Gly Ala 
90 

Leu Glu 
105 



Leu Ser Glu Thr 
45 

Glu Leu Ser Glu 

Phe His Phe Tyr 

80 

Asn Pro Ala Ala 
95 



<210> 133 
<211> 108 
<212> PRT 

<213> Pseudomonas aeruginosa 



<400> 133 




Met 


Ser 


Glu 


His 


1 








Val 


Leu 


Lys 


Ala 








20 


Cys 


Gly 


Pro 


Cys 






35 




Asp 


Tyr 


Gin 


Gly 




50 






Gin 


Asp 


Thr 


Pro 


65 








Leu 


Phe 


Lys 


Asp 


Lys 


Ser 


Gin 


Leu 






100 



He 


Val 


Asn 


Val 


5 








Asp 


Gly 


Pro 


Val 


Lys 


Met 


He 


Ala 








40 


Lys 


Leu 


Lys 


Val 




55 




Pro 


Lys 


Tyr 


Gly 




70 






Gly 


Asn 


Val 


Glu 


85 








Ala 


Ala 


Phe 


Leu 



Thr Asp Ala Ser 
10 

Leu Val Asp Tyr 
25 

Pro Val Leu Asp 

Cys Lys Leu Asn 

60 

Val Arg Gly He 
75 

Ala Thr Lys Val 
90 

Asp Ala Asn He 
105 



Phe Glu Gin Asp 
15 

Trp Ala Glu Trp 
30 

Glu Val Ala Arg 
45 

He Asp Glu Asn 

Pro Thr Leu Met 

80 

Gly Ala Leu Ser 
95 



<210> 134 
<211> 104 
<212> PRT 

<213> Rhodospir ilium rub rum 
<220> 

<221> VARIANT 
<222> 21, 35 

<223> Xaa = Any Amino Acid 



<400> 134 




Met 


Lys 


Gin 


Val 


1 








Asp 


Gly 


Pro 


Asn 








20 


Arg 


Gin 


Xaa 


Ala 






35 




Lys 


Val 


Thr 


Val 


50 






Ser 


Lys 


Tyr 


Gly 


65 








Gly 


Gin 


Val 


Ala 


Phe 


Glu 


Trp 


Val 








100 



Ser Asp Ala Ser 
5 

Xaa Val Asp Phe 

Pro Ala Leu Glu 

40 

Ala Lys He Asn 
55 

Val Arg Gly He 
70 

Ala Thr Lys He 
85 

Glu Ala Ser Val 



Phe Glu Glu Asp 
10 

Trp Ala Glu Trp 
25 

Glu Leu Ala Thr 

He Asp Glu Asn 

60 

Pro Thr Leu Met 
75 

Gly Ala Leu Pro 
90 



Val Leu Lys Ala 
15 

Cys Gly Pro Cys 
30 

Ala Leu Gly Asp 
45 

Pro Gin Thr Pro 

He Phe Lys Asp 

80 

Lys Thr Lys Leu 
95 



<210> 135 
<211> 105 
<212> PRT 

<213> Rhodobacter sphaeroides 
<400> 135 
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Ser 


Thr 


Val 


Pro 


1 








Ser 


Asp 


Val 


Pro 








20 


Cys 


Arg 


Gin 


He 






35 




Gly 


Lys 


Val 


Lys 




50 






Pro 


Ala 


Met 


Leu 


65 








Asn Gly 


Gin 


Val 


Leu 


Ala 


Thr 


Trp 








100 



Val Thr Asp Ala 
5 

Val Val Val Asp 

Gly Pro Ala Leu 

40 

He Val Lys Val 
55 

Gly Val Arg Gly 
70 

Val Ser Asn Lys 
85 

He Ala Ser Ala 



Thr Phe Asp Thr 
10 

Phe Trp Ala • Glu 
25 

Glu Glu Leu Ser 

Asn Val Asp Glu 

60 

He Pro Ala Leu 
75 

Val Gly Ala Ala 
90 

Leu 
105 



Glu Val Arg Lys 
15 

Trp Cys Gly Pro 
30 

Lys Glu Tyr Ala 
45 

Asn Pro Glu Ser 

Phe Leu Phe Lys 

80 

Pro Lys Ala Ala 
95 



<210> 136 
<211> 130 
<212> PRT 

<213> Rickettsia prowazekii 



<400> 136 




Met 


Ser 


Cys 


Tyr 


1 








Asp 


He 


Asn 


Thr 






20 


Ser 


Ser 


Phe 


Lys 






35 




Asp 


Phe 


Trp 


Ala 




50 






He 


Asp 


Glu 


He 


65 








Met 


Asn 


He 


Asp 


Ser 


He 


Pro 


Thr 








100 


Lys 


He 


Gly 


Leu 






115 




Ser 


He 








130 







Asn Glu He Thr 
5 

Thr Gin Arg He 

Asn Glu Val Leu 

40 

Glu Trp Cys Gly 
55 

Ser Lys Glu Leu 
70 

Glu Asn Pro Lys 
85 

He Met Leu Phe 

Gin Gin Lys Asn 

120 



Thr Leu Leu Glu 
10 

Asn Met Val Asn 
25 

Glu Ser Asp Leu 

Pro Cys Lys Met 

60 

Gin Asp Lys Val 
75 

Thr Pro Ser Glu 
90 

Lys Asn Gly Glu 
105 

Ser Leu Leu Asp 



Phe Asp Ser Asn 

15 . 

Asn Val Thr Asp 
30 

Pro Val Met Val 
45 

Leu He Pro He 

Lys Val Leu Lys 

80 

Tyr Gly He Arg 
95 

Gin Lys Asp Thr 
110 

Trp He Asn Lys 
125 



<210> 137 
<211> 106 
<212> PRT 

<213> Streptomyces aureofaciens 

Gly°Ala 3 Thr Val Lys Val Thr Asn Ala Thr Phe Lys Ser Asp Val Leu 

1 5 10 15 

Glu Ser Asp Lys Pro Val Leu Val His Phe Glu Gly Pro Trp Cys Gly 

20 25 30 

Pro Cys Lys Met Val Ala Pro Val Leu Asp Glu He Ala Asn Glu Tyr 

35 40 45 

Glu Gly Lys Val Lys Val Ala Lys Val Asn Thr Asp Glu Asn Pro Gin 

50 55 60 

Leu Ala Ser Gin Tyr Gly Val Arg Ser He Pro Thr Arg Leu Met Phe 
65 ** 70 75 8 0 

Lys Gly Gly Glu Val Ala Ala Asn Met Val Gly Ala Ala Pro Lys Thr 

85 90 95 

Arg Leu Ala Ala Phe Leu Asp Ala Ser Leu 

10 0 105 



<210> 138 
<211> HO 
<212> PRT 
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<213> Streptomyces coelicolor 



50 

Pro Gl- 
65 



a 

Gly 


Thx 


Leu 


Lys 


His 


Val 


Thr 


Asp 


Asp 


Ser 


Phe 


Glu 


Gin 


Asp 




5 








10 










15 




Lys 


Asn 


Asp 


Lys 


Pro 


Val 


Leu 


Val 


Asp 


Phe 


Trp 


Ala 


Ala 


Trp 


20 






25 










30 




Ala 


Pro 


Cys 


Arg 


Gin 


He 


Ala 


Pro 


Ser 


Leu 


Glu 


Ala 


He 


Ala 


35 






40 










45 




Glu 


Asn 


Gly 


Asp 


Lys 


lie 


Glu 


He 


Val 


Lys 


Leu 


Asn 


He 


Asp 




55 










60 










Thr 


Ala 


Ala 


Lys 


Tyr 


Gly 


Val 


Met 


Ser 


He 


Pro 


Thr 


Leu 


Asn 








70 








75 










80 


Gin 


Gly 


Gly 


Glu 


Val 


Ala 


Lys 


Thr 


He 


Val 


Gly 


Ala 


Lys 


Pro 




85 










90 










95 




Ala 


lie 


Val 


Arg 


Asp 


Leu 


Glu 


Asp 


Phe 


He 


Ala 


Asp 








100 






105 










110 







<210> 139 
<211> 107 
<212> PRT 

<213> Streptomyces clavuligerus 

<400> 139 , _ _ 

Met Ala Gly Val Leu Lys Asn Val Thr Asp Asp Thr Phe Glu Ala Asp 

15 10 15 

Val Leu Lys Ser Glu Lys Pro Val Leu Val Asp Phe Trp Ala Glu Trp 

20 25 30 

Cys Gly Pro Cys Arg Gin He Ala Pro Ser Leu Glu Ala He Thr Glu 

35 * " 40 45 

His Gly Gly Gin He Glu He Val Lys Leu Asn He Asp Gin Asn Pro 

50 .55 60 
Ala Thr Ala Ala Lys Tyr Gly Val Met Ser He Pro Thr Leu Asn Val 
65 70 75 80 

Tvr Gin Gly Gly Glu Val Val Lys Thr He Val Gly Ala Lys Pro Lys 

* 85 90 95 

Ala Ala Leu Leu Arg Pro Gly Pro Val Pro Arg 

100 105 

<210> 140 
<211> 106 
<212> PRT 

<213> Synechocystis 

<400> 140 „ tt i t 

Ser Ala Thr Pro Gin Val Ser Asp Ala Ser Phe Lys Glu Asp Val Leu 

1 5 10 15 

Asp Ser Glu Leu Pro Val Leu Val Asp Phe Trp Ala Pro Trp Cys Gly 

20 25 30 

Pro Cys Arg Met Val Ala Pro Val Val Asp Glu He Ser Gin Gin Tyr 

35 40 45 

Glu Gly Lys Val Lys Val Val Lys Leu Asn Thr Asp Glu Asn Pro Asn 

50 " 55 60 

Thr Ala Ser Gin Tyr Gly He Arg Ser He Pro Thr Leu Met He Phe 
65 70 75 80 

Lys Gly Gly Gin Arg Val Asp Met Val Val Gly Ala Val Pro Lys Thr 

85 90 95 

Thr Leu Ala Ser Thr Leu Glu Lys Tyr Leu 

100 105 

<210> 141 
<211> 109 
<212> PRT 

<213> Synechocystis 
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<400> 141 




Met 


Ser 


Leu 


Leu 


1 








Gly 


Gin 


Thr 


Lys 








20 


Pro 


Cys 


Arg 


Leu 






35 




Gly Asp 


Lys 


Leu 




50 






Ala 


Val 


Ala 


Gin 


65 








Lys 


Asn 


Asn 


Glu 


Lys 


Leu 


Leu 


Glu 






100 



Glu He Thr Asp 
5 

Pro Val Leu Val 

Met Ala Pro Ala 

40 

Lys Val Leu Lys 
55 

Cys Lys Val Glu 
70 

Leu Val Met Thr 
85 

Leu Leu Lys Glu 



Ala Glu Phe Glu 

10 

Tyr Phe Trp Ala 
25 

He Gin Ala He 

Leu Glu Val Asp 

SO 

Gly Val Pro Ala 
7 5 

His Glu Gly Ala 
90 

Glu Leu Asp Phe 
105 



Gin Glu Thr Gin. 
15 

Ser Trp Cys Gly 
30 

Ala Lys Asp Tyr 
45 

Pro Asn Pro Ala 

Leu Arg Leu Phe 

80 

He Ala Lys Pro 
95 

He 



<210> 142 
<211> 108 
<212> PRT 

<213> Thiobacillus ferrooxidans 



65 



> 142 




















Phe 


Glu 


Thr 
15 


Asp 


Ser 


Asp 


Ala 


He 
5 


Leu 


Tyr 


Val 


Ser 


Asp 
10 


Asp 


Ser 


Leu 


Lys 


Ser 


Ser 


Lys 


Pro 


Val 


Leu 


Val 


Asp 


Phe 


Trp 


Ala 


Glu 


Trp 




20 








25 










30 


Ala 




Gly 


Pro 


Cys 


i«ys 


Met 


He 


Ala 


Pro 


He 


Leu 


Glu 


Glu 


He 


ASp 


35 








40 










45 




Glu 


Asn 


Tyr 


Ala 


Asp 


Axg 


Leu 


Arg 


Val 


Ala 


Lys 


Phe 


Asn 


He 


Asp 


50 








55 










60 




Thr 


Leu 


Leu 


Asn 


Thr 


Pro 


Pro 


Gin 


Tyr 


Ala 


He 


Arg 


Gly 


He 


Pro 








70 








75 










80 


Phe 


Lys 


Ala 


Gly 
85 


Lys 


Leu 


Glu 


Ala 


Thr 
90 


Lys 


Val 


Gly 


Ala 


Leu 
95 


Ser 


Ala 


Gin 


Leu 
100 


Thr 


Ala 


Phe 


Leu 


Asp 
105 


Ser 


Gin 


Leu 











<210> 143 
<211> 91 
<212> PRT 

<213> Thiocapsa roseopersicina 

Met°Ser 4 Asp Ser He Val His Val Thr Asp Asp Ser Phe Glu Asp Glu 

15 10 15 

Val Leu Lys Ser Leu Glu Pro Val Leu Val Asp Tyr Trp Ala Asp Trp 

20 25 30 

Cys Gly Pro Cys Lys Met He Ala Pro Val Leu Asp Glu He Ala Gly 

35 40 45 

Glu Tyr Ala Gly Arg He Lys Val Ala Lys Leu Asn He Asp Glu Asn 

50 55 60 

Pro Asn Thr Pro Arg Arg Tyr Gly He Arg Gly He Pro Thr Leu Met 
65 70 75 80 

Leu Ser Arg Gin Ser Glu Val Glu Ala Thr Lys 

85 90 



<210> 144 
<211> 44 
<212> PRT 

<213> Tissierella creatinophila 
<400> 144 

Met He Glu Leu Asp Lys Ser Asn Phe Glu Glu Glu Val Leu Lys Ala 
1 5 10 15 



-85- 



WO 02/50289 



PCTAJS01/50240 



Glu Gly Thr Val 

20 

Lys Ala Leu Met 
35 



Leu Val Asp Phe 

Pro His Val His 

40 



Trp Ser Pro Ser 

25 

Asp Phe Glu Glu 



Cys Glu Pro Cys 
3 0 



<210> 145 
<211> 105 
<212> PRT 
<213> Treponema 



pallidum 



<400> 145 




Met 


Ala 


Leu 


Leu 


1 








Thr 


Asn 


Pro 


Leu 








20 


Cys 


Lys 


Met 


Leu 




35 




Ser 


Gly 


Val 


Val 




50 






Ala 


Val 


Glu 


Phe 


65 








Asp 


Gly 


Lys 


Glu 


He 


Leu 


Thr 


Leu 








100 



Asp He Ser Ser 
5 

Val He Val Asp 

Gly Pro Val Leu 

40 

He Gly Lys Leu 
55 

Asn Val Ala Ser 
70 

Val Asp Arg Ser 
85 

lie Gin Lys Asn 



Gly Asn Val Arg 
10 

Phe Trp Ala Pro 
25 

Glu Glu Val Glu 

Asn Val Asp Asp 

60 

He Pro Thr Leu 
75 

He Gly Phe Val 
90 

Ala 
105 



Lys Thr He Glu 
15 

Trp Cys Gly Ser 
30 

Ser Glu Val Gly 
45 

Asp Gin Asp Leu 

He Val Phe Lys 

80 

Asp Lys Ser Lys 
95 



<210> 146 
<211> 104 
<212> PRT 
<213> Bos taurus 



<400> 146 
Val Lys Gin lie 
1 

Ala Gly Glu Lys 

20 

Pro Cys Lys Met 
35 

Ser Asn Val Val 
50 

Ala Ala Glu Cys 
65 

Lys Gly Gin Lys 

Glu Ala Thr He 

100 



Glu Ser Lys Tyr 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe Leu Glu Val 
55 

Glu Val Lys Cys 
70 

Val Gly Glu Phe 
85 

Asn Glu Leu He 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 
25 

Phe His Ser Leu 

Asp Val Asp Asp 

60 

Met Pro Thr Phe 
75 

Ser Gly Ala Asn 
90 



Ala Leu Asn Ser 
15 

Thr Trp Cys Gly 
30 

Ser Glu Lys Tyr 
45 

Cys Gin Asp Val 

Gin Phe Phe Lys 

80 

Lys Glu Lys Leu 

95 



<210> 147 
<211> 166 
<212> PRT 
<213> Bos taurus 



<400> 147 

Met Ala Gin Arg Leu 

1 5 
Gly Lys Pro Ser Gin 

20 

Lys Thr Pro Gin Tyr 
.35 

Ala Arg Ser He Tyr 
50 

Gin Asp Gly Pro Asp 
65 

Val Val Val Asp Phe 



Leu Leu Arg Arg Phe Leu 

10 

Ser Arg Trp Ala Pro Val 

25 

Ser Pro Gly Tyr Leu Thr 
40 

Thr Thr Arg Val Cys Ser 
55 

Phe Gin Asp Arg Val Val 
70 75 
His Ala Gin Trp Cys Gly 



Thr Ser He He Ser 

15 

Ala Ser Arg Ala Leu 
30 

Val Thr Pro Ser Gin 
45 

Thr Thr Phe Asn He 
60 

Asn Ser Glu Thr Pro 

80 

pro Cys Lys He Leu 
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Gly Pro Arg Leu 

100 

Met Ala Lys Val 
115 

Glu Val Ser Ala 
130 

Val Asp Lys Phe 
145 

Leu Lys Lys Leu 



85 

Glu Lys Val Val 

Asp lie Asp Asp 

120 

Val Pro Thr Val 
135 

Val Gly He Lys 

150 
He Gly 
165 



90 

Ala Lys Gin His 
105 

His Thr Asp Leu 

Leu Ala Met Lys 

140 

Asp Glu Asp Gin 
155 



95 

Gly Lys Val Val 
110 

Ala Leu Glu Tyr 
125 

Asn Gly Asp Val 

Leu Glu Ala Phe 

160 



<210> 148 
<211> 115 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 148 




Met 


Leu 


Lys 


Arg 


1 








Asp 


Phe 


Glu 


Gin 






20 


Asp 


Phe 


Tyr 


Ala 






35 




Tyr 


Lys 


Glu 


Leu 




50 






Asp 


Val 


Asp 


Glu 


65 








Met 


Pro 


Thr 


Phe 


Glu Gly 


Cys 


Val 








100 


Ser 


Ala 


Gin 








115 





Cys Asn Phe Lys 
5 

Leu He Arg Gin 

Thr Trp Cys Gly 

40 

Ala Thr Thr His 

55 

Ala Glu Asp Leu 
70 

He Phe Thr Lye 
85 

Glu Asp Glu Leu 



Asn Gin Val Lys 
10 

His Pro Glu Lys 
25 

Pro Cys Lys Ala 

Lys Gly lie He 

60 

Cys Ser Lys Tyr 
75 

Asn Gly Asp Ala 
90 

Arg Gin Lys Val 
105 



Tyr Phe Gin Ser 
15 

He He He Leu 
30 

He Ala Pro Leu 
45 

Phe Cys Lys Val 

Asp Val Lys Met 

80 

He Glu Ala Leu 
95 

Leu Glu His Val 
110 



<210> 149 
<211> 20 
<212> PRT 

<213> Canis familiaris 



Val Lys Gin He Glu Phe Lys Tyr Ala Phe Gin Glu Ala Leu Asn Ser 

1 5 10 15 

Ala Gly Asp Lys 

20 



<210> 150 
<211> 104 
<212> PRT 

<213> Gallus gallus 



<400> 150 
Val Lys Ser Val 
1 

Ala Gly Glu Lys 

20 

Pro Cys Lys Met 
35 

Gly Asp Val Val 
50 

Ala Thr His Cys 
65 

Asn Gly Lys Lys 



Gly Asn Leu Ala 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe He Glu He 
55 

Asp Val Lys Cys 
70 

Val Gin Glu Phe 
85 



Asp 


Phe 


Glu 


Ala 




10 






Asp 


Phe 


Ser 


Ala 


25 








Phe 


His 


Ser 


Leu 


Asp 


Val 


Asp 


Asp 






60 


Met 


Pro 


Thr 


Phe 






75 




Ser 


Gly Ala 


Asn 




90 







Glu Leu Lys Ala 
15 

Thr Trp Cys Gly 
30 

Cys Asp Lys Phe 
45 

Ala Gin Asp Val 

Gin Phe Tyr Lys 

80 

Lys Glu Lys Leu 
95 
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Glu Glu Thr lie Lys Ser Leu Val 

100 

.* 

<210> 151 
<211> 107 
<212> PRT 

<213> Drosophila melanogaster 
<400> 151 



Met 


Ala 


Ser 


Val 


Arg 


Thr 


Met 


Asn 


Asp 


Tyr 


His 


Lys 


Arg 


He 


Glu 


Ala 


1 








5 










10 










15 




Ala 


Asp 


Asp 


Lys 


Leu 


lie 


Val 


Leu 


Asp 


Phe 


Tyr 


Ala 


Thr 


Trp 


Cys 


Gly 




20 










25 










30 






Pro 


Cys 


Lys 


Glu 


Met 


Glu 


Ser 


Thr 


Val 


Lys 


Ser 


Leu 


Ala 


Arg 


Lys 


Tyr 




35 










40 










45 








Ser 


Ser 


Lys 


Ala 


Val 


Val 


Leu 


Lys 


He 


Asp 


Val 


Asp 


Lys 


Phe 


Glu 


Glu 




50 








55 










60 










Leu 


Thr 


Glu 


Arg 


Tyr 


Lys 


Val 


Arg 


Ser 


Met 


Pro 


Thr 


Phe 


Val 


Phe 


Leu 


65 






70 










75 








His 


80 


Arg 


Gin 


Asn 


Arg 


Arg 


Leu 


Ala 


Ser 


Phe 


Ala 


Gly 


Ala 


Asp 


Glu 


Lys 






85 










90 










95 




Leu 


Thr 


Asn 


Met 
100 


Met 


Ala 


Lys 


Leu 


Val 
105 


Lys 


Ala 













<210> 152 

<211> 104 

<212> PRT 

<213> Homo sapien 



<400> 152 
Val Lys Gin He 
1 

Ala Gly Asp Lys 

20 

Pro Cys Lys Met 
35 

Ser Asn Val He 
50 

Ala Ser Glu Cys 
65 

Lys Gly Gin Lys 

Glu Ala Thr He 

100 



Glu Ser Lys Thr 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe Leu Glu Val 
55 

Glu Val Lys Cys 
70 

Val Gly Glu Phe 
85 

Asn Glu Leu Val 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 
25 

Phe His Ser Leu 

Asp Val Asp Asp 

60 

Met Pro Thr Phe 
75 

Ser Gly Ala Asn 
90 



Ala Leu Asp Ala 
15 

Thr Trp Cys Gly 
30 

Ser Glu Lys Tyr 
45 

Cys Gin Asp Val 

Gin Phe Phe Lys 

80 

Lys Glu Lys Leu 
95 



<210> 153 
<211> 166 
<212> PRT 
<213> Homo sapien 



<400> 153 



Met 


Ala 


Gin 


Arg 


Leu 


Leu 


Leu 


Arg 


Arg 


Phe 


Leu 


Ala 


Ser 


Val 


He 


Ser 


1 






5 










10 










15 




Arg 


Lys 


Pro 


Ser 


Gin 


Gly 


Gin 


Trp 


Pro 


Pro 


Leu 


Thr 


Ser 


Lys 


Ala 


Leu 




20 








25 










30 






Gin 


Thr 


Pro 


Gin 


Cys 


Ser 


Pro 


Gly 


Gly 


Leu 


Thr 


Val 


Thr 


Pro 


Asn 


Pro 






35 








40 










45 








Ala 


Arg 


Thr 


He 


Tyr 


Thr 


Thr 


Arg 


He 


Ser 


Leu 


Thr 


Thr 


Phe 


Asn 


He 




50 








55 










60 










Gin 


Asp 


Gly 


Pro 


Asp 


Phe 


Gin 


Asp 


Arg 


Val 


Val 


Asn 


Ser 


Glu 


Thr 


Pro 


65 






70 










75 








He 


80 


Val 


Val 


Val 


Asp 


Phe 


His 


Ala 


Gin 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Leu 








85 










90 










95 




Gly 


Pro 


Arg 


Leu 


Glu 


Lys 


Met 


Val 


Ala 


Lys 


Gin 


His 


Gly 


Lys 


Val 


Val 
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100 

Met Ala Lys Val 
115 

Glu Val Ser Ala 
130 

Val Asp Lys Phe 
145 

Leu Lys Lys Leu 



Asp lie Asp Asp 

120 

Val Pro Thr Val 
135 

Val Gly He Lys 

150 
He Gly 
165 



105 

His Thr Asp Leu 

Leu Ala Met Lys 

140 

Asp Glu Asp Gin 
155 



110 

Ala He Glu Tyr 
125 

Asn Gly Asp Val 

Leu Glu Ala Phe 

160 



<210> 154 
<211> 104 
<212> PRT 

<213> Macaca mulatta 



<400> 154 




Val 


Lys 


Gin 


He 


1 








Ala 


Gly 


Asp 


Lys 








20 


Pro 


Cys 


Lys 


Met 






35 




Ser 


Asn 


Val 


Val 




50 






Ala 


Ser 


Glu 


Cys 


65 








Lys 


Gly 


Gin 


Lys 


Glu 


Ala 


Thr 


He 








100 



Glu Ser Lys Ala 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe Leu Glu Val 
55 

Glu Val Lys Cys 
70 

Val Gly Glu Phe 
85 

Asn Glu Leu Val 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 
25 

Phe His Ser Leu 

Asp Val Asp Asp 

60 

Met Pro Thr Phe 
75 

Ser Gly Ala Asn 
90 



Ala Leu Asp Asp 
15 

Thr Trp Cys Gly 
30 

Ser Glu Lys Tyr 
45 

Cys Gin Asp Val 

Gin Phe Phe Lys 

80 

Lys Glu Lys Leu 
95 



<210> 155 
<211> 104 
<212> PRT 

<213> Mus mus cuius 



<400> 155 
Val Lys Leu He 
1 

Ala Gly Asp Lys 

20 

Pro Cys Lys Met 
35 

Ser Asn Val Val 
50 

Ala Ala Asp Cys 
65 

Lys Gly Gin Lys 

Glu Ala Ser He 

100 



Glu Ser Lys Glu 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe Leu Glu Val 
55 

Glu Val Lys Cys 
70 

Val Gly Glu Phe 
85 

Thr Glu Tyr Ala 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 
25 

Phe His Ser Leu 

Asp Val Asp Asp 

60 

Met Pro Thr Phe 
75 

Ser Gly Ala Asn 
90 



Ala Leu Ala Ala 
15 

Thr Trp Cys Gly 
30 

Cys Asp Lys Tyr 
45 

Cys Gin Asp Val 

Gin Phe Tyr Lys 

80 

Lys Glu Lys Leu 
95 



<210> 156 
<211> 166 
<212> PRT 

<213> Mus mus cuius 



<400> 156 

Met Ala Gin Arg Leu Leu Leu Gly Arg Phe Leu Thr Ser Val He Ser 

1 5 10 15 

Arq Lys Pro Pro Gin Gly Val Trp Ala Ser Leu Thr Ser Lys Thr Leu 

20 25 30 

Gin Thr Pro Gin Tyr Asn Ala Gly Gly Leu Thr Val Met Pro Ser Pro 
35 40 45 



-89- 



WO 02/50289 



PCT/US01/50240 



Ala Arg Thr Val 
50 

Gin Asp Gly Pro 
65 

Val Val Val Asp 

Gly Pro Arg Leu 

100 

Met Ala Lys Val 
115 

Glu Val Ser Ala 
13 0 

Val Asp Lys Phe 
145 

Leu Lys Lys Leu 



His Thr Thr Arg 
55 

Asp Phe Gin Asp 
70 

Phe His Ala Gin 
85 

Glu Lys Met Val 

Asp lie Asp Asp 

120 

Val Pro Thr Val 
135 

Val Gly lie Lys 

150 
lie Gly 
165 



Val Cys Leu Thr 

60 

Arg Val Val Asn 
75 

Trp Cys Gly Pro 
90 

Ala Lys Gin His 
105 

His Thr Asp Leu 

Leu Ala lie Lys 

140 

Asp Glu Asp Gin 
155 



Thr Phe Asn Val 

Ser Glu Thr Pro 

80 

Cys Lys lie Leu 
95 

Gly Lys Val Val 
110 

Ala lie Glu Tyr 

125 

Asn Gly Asp Val 

Leu Glu Ala Phe 

160 



<210> 157 
<211> 33 
<212> PRT 
<213> Sus scrofa 



<400> 157 

Val Lys Gin lie Glu Ser Lys Tyr 

1 5 
Ala Gly Glu Lys Leu Val Val Val 

20 

Pro 



Ala Phe Gin Glu Ala Leu Asn Ser 

10 15 
Asp Phe Ser Ala Thr Trp Cys Gly 
25 30 



<210> 158 
<211> 104 
<212> PRT 
<213> Oryctolagua 



cuni cuius 



<400> 158 




Val 


Lys 


Gin 


He 


1 








Ala 


Gly 


Asp 


Lys 








20 


Pro 


Cys 


Lys 


Met 






35 




Asn 


Asn 


Val 


Val 




50 






Ala 


Ala 


Glu 


Cys 


65 








Lys 


Gly 


Gin 


Lys 


Glu 


Ala 


Thr 


He 








100 



Glu Ser Lys Ser 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe He Glu Val 
55 

Glu Val Lys Cys 
70 

Val Gly Glu Phe 
85 

Asn Glu Leu Leu 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 

25 

Phe His Ala Leu 

Asp Val Asp Asp 

60 

Met Pro Thr Phe 
75 

Ser Gly Ala Asn 
90 



Val Leu Asp Ser 
15 

Thr Trp Cys Gly 
30 

Ser Glu Lys Phe 
45 

Cys Lys Asp He 

Gin Phe Phe Lys 

80 

Lys Glu Lys Leu 
95 



<210> 159 
<211> 104 
<212> PRT 

<213> Rattus norvegicus 



Val Lys Leu He 
1 

Ala Gly Asp Lys 

20 

Pro Cys Lys Met 
35 

Ser Asn Val Val 



Glu Ser Lys Glu 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe Leu Glu Val 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 
25 

Phe His Ser Leu 
Asp Val Asp Asp 



Ala Leu Ala Ala 
15 

Thr Trp Cys Gly 
30 

Cys Asp Lys Tyr 
45 

Cys Gin Asp Val 
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50 55 60 

Ala Ala Asp Cys Glu Val Lys Cys Met Pro Thr Phe Gin Phe Tyr Lys 
SS 70 75 80 

Lys Glv Gin Lys Val Gly Glu Phe Ser Gly Ala Asn Lys Glu Lys Leu 

85 90 95 

Glu Ala Thr lie Thr Glu Phe Ala 

100 



<210> 160 
<211> 166 
<212> PRT 

<213> Rattus norvegicus 



<400> 160 
Met Ala Gin Arg 
1 

Arg Lys Pro Pro 

20 

Gin Thr Pro Pro 
35 

Ala Arg Thr Phe 
50 

Gin Asp Gly Pro 
65 

Val Val Val Asp 

Gly Pro Arg Leu 

100 

Met Ala Lys Val 
115 

Glu Val Ser Ala 
130 

Val Asp Lys Phe 
145 

Leu Lys Lys Leu 



Leu Leu Leu Arg 

5 

Gin Gly Val Trp 

Tyr Asn Ala Gly 

40 

His Thr Thr Arg 
55 

Asp Phe Gin Asp 
70 

Phe His Ala Gin 
85 

Glu Lys Met Val 

Asp lie Asp Asp 

120 

Val Pro Thr Val 
135 

Val Gly lie Lys 

150 
lie Gly 
165 



Arg 


Phe 


Leu 


Thr 




10 






Ala 


Ser 


Leu 


Thr 


25 








Gly 


Leu 


Thr 


Gly 


Val 


Cys 


Ser 


Thr 








60 


Arg 


Val 


Val 


Asn 






75 




Trp 


Cys 


Gly 


Pro 




90 






Ala 


Lys 


Gin 


His 


105 








His 


Thr 


Asp 


Leu 


Leu 


Ala 


lie 


Lys 








140 


Asp 


Glu Asp 


Gin 






155 





Ser Val lie Ser 
15 

Ser Thr Ser Leu 
30 

Thr Pro Ser Pro 
45 

Thr Phe Asn Val 

Ser Glu Thr Pro 

80 

Cys Lys He Leu 
95 

Gly Lys Val Val 
110 

Ala He Glu Tyr 
125 

Asn Gly Asp Val 

Leu Glu Ala Phe 

160 



<210> 161 
<211> 104 
<212> PRT 
<213> Ovis ariea 



<400> 161 




Val 


Lys 


Gin 


He 


1 








Ala 


Gly 


Glu 


Lys 








20 


Pro 


Cys 


Lys 


Met 






35 




Ser 


Asn 


Val 


Val 




50 






Ala 


Ala 


Glu 


Cys 


65 








Lys 


Gly 


Gin 


Lys 


Glu 


Ala 


Thr 


He 








100 



Glu Ser Lys Tyr 
5 

Leu Val Val Val 

He Lys Pro Phe 

40 

Phe Leu Glu Val 
55 

Glu Val Lys Cys 
70 

Val Ser Glu Phe 
85 

Asn Glu Leu He 



Ala Phe Gin Glu 
10 

Asp Phe Ser Ala 
25 

Phe His Ser Leu 

Asp Val Asp Asp 

60 

Met Pro Thr Phe 
75 

Ser Gly Ala Asn 
90 



Ala Leu Asn Ser 
15 

Thr Trp Cys Gly 
30 

Ser Glu Lys Tyr 
45 

Cys Gin Asp Val 

Gin Phe Phe Lys 

80 

Lys Glu Lys Leu 
95 



<210> 162 
<211> 261 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 162 
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Met Ala Arg Leu 
1 

Asn Leu Ser Pro 

20 

Ala Pro Arg Phe 
35 

Glu Thr Lys Gin 
50 

Ala Ser Lys Glu 
65 

Ser Ser Ser Ser 

Arg Val Ala Ala 

100 

Thr Arg Leu Asp 
115 

Leu Pro Tyr Glu 
130 

Phe Tyr Ala Asp 
145 

Tyr Lys lie Glu 

Asn Val Asp Asn 

180 

Glu Gly He Pro 
195 

Gly Asn Val Val 
210 

Asn Ala Leu Ala 
225 

Gly Gin Tyr Ser 

Pro Leu Ser His 

260 



Val Phe Ser Leu 

5 

Arg Asn Leu Gin 

Arg Ala Val Arg 

40 

Glu Lys Leu Val 
55 

Val Glu Ser Ser 
70 

Gly Phe Pro Glu 
85 

Val Thr Val He 

Phe Gly He Ser 

120 

Glu Ala Leu Ser 
135 

Trp Cys Glu Val 
150 

Gin Gin Tyr Lys 
165 

Thr Lys Trp Glu 

His Phe Ala Phe 

200 

Gly Arg Leu Pro 
215 

Ala Gly Lys Gin 
230 

Ser Ser Glu Ser 

245 

Gly 



Asn Leu Pro Ser 

10 

Ser Phe Phe Val 
25 

Cys Lys Pro Asn 

He Asp Asn Gly 

60 

Ser Ser Val Ala 
75 

Ser Pro Asn Lys 
90 

Ala Ala Leu Ser 
105 

Leu Lys Asp Leu 

Asn Gly Lys Pro 

14 0 

Cys Arg Glu Leu 
155 

Asp Lys Val Asn 
170 

Gin Glu Leu Asp 
185 

Leu Asp Arg Glu 

Arg Gin Tyr Leu 

22 0 

Ser He Pro Tyr 
235 

Arg Lys Val His 
250 



Ser His Gly Phe 
15 

Thr Gin Thr Gly 
30 

Pro Glu Ser Ser 
45 

Glu Thr Ser Ser 

Asp Ser Ser Ser 

80 

Asp He Asn Arg 
95 

Leu Phe Val Ser 
110 

Thr Ala Ser Ala 
125 

Thr Val Val Glu 

Ala Pro Asp Val 

160 

Phe Val Met Leu 
175 

Glu Phe Gly Val 
190 

Gly Asn Glu Glu 
205 

Val Glu Asn Val 

Ala Arg Ala Val 

240 

Gin Val Thr Asp 
255 



<210> 163 
<211> 140 
<212> PRT 
<213> Arabidopsis 



thaliana 



<400> 163 
Met Gly Ser Cys 
1 

Asn Val Glu Phe 

20 

Ser Trp Asp Asp 
35 

Val Ala Asn Phe 
50 

Pro Phe Phe He 
65 

Leu Val Asp Val 

Lys Ala Thr Pro 

100 

Lys Leu Val Gly 
115 

He He Asp Ser 
130 



Val Ser Lys Gly 
5 

Ser Gly Gly Asn 

Lys Leu Ala Glu 

40 

Ser Ala Thr Trp 
55 

Glu Leu Ser Glu 
70 

Asp Glu Leu Ser 
85 

Thr Phe Phe Phe 

Ala Asn Lys Pro 

120 

Val Pro Glu Ser 
135 



Lys Gly Asp Asp 
10 

Val His Leu He 
25 

Ala Asp Arg Asp 

Cys Gly Pro Cys 

6 0 

Lys His Ser Ser 
75 

Asp Phe Ser Ser 
90 

Leu Lys Asn Gly 
105 

Glu Leu Gin Lys 

pro Gin Arg Pro 

14 0 



Asp Ser Val His 
15 

Thr Thr Lys Glu 
30 

Gly Lys lie Val 
45 

Lys He Val Ala 

Leu Met Phe .Leu 

80 

Ser Trp Asp He 
95 

Gin Gin He Gly 
110 

Lys Val Thr Ser 
125 



<210> 164 
<211> 186 
<212> PRT 
<213> Arabidopsis 



thaliana 
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<400> 164 
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J.U 


lie 


1 








Lys 


lie 
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Pro 








O BIT 




i I u 








-3 o 




iicU 


ax ti- 


Al a 






er n 






Gly 


Leu 


Gin 


Glu 


65 








x ie 




<j xy 




Leu 


Gly 


Glu 


Ser 








100 


Cys 


lie 


Tyr 


Leu 






115 




Pro 


Arg 


Leu 


Arg 




13 0 






Leu 


Val 


Ser 


Arg 


145 








Ala 


Glu 


Val 


lie 


Arg 


Glu 


Met 


lie 








180 



Val Asn Leu Ser 
5 

Leu Val Pro Pro 

Asn Phe Lys Tyr 

40 

Arg He Arg Ala 
55 

Leu Asp Asp Ser 
70 

Ser His Phe Asp 
85 

Val Val He Val 

Lys Pro Lys Leu 

120 

Phe Tyr His Val 
135 

Ala Gly Val Thr 
150 

Gly Gly His Lys 
165 

Glu Asn Asp Ser 



Ser Ser Leu Arg 
10 

Tyr Arg Gin Thr 
25 

His Ser Phe Thr 

Val Asp He Gin 

60 

Pro Val Ser Val 
75 

Gin Val Met Glu 
90 

Trp Met Ala Ala 
105 

Glu Lys Leu Ala 

Asp Val Asn Ala 

140 

Leu Trp Arg Asp 
155 

Ala His Phe Val 

17 0 
He Thr 
185 



Ser Leu Asn Pro 
15 

Ser Ser Ser Phe 
30 

Asp Lys He Cys 
45 

Lys Gin Asp Gly 

Glu Leu Gly Pro 

80 

Asp Ala Gin Lys 
95 

Trp Cys Arg Lys 
110 

Ala Glu Phe Tyr 
125 

Val Pro Tyr Arg 

Gly Gin Lys Gin 

160 

Val Asn Glu Val 
175 



<210> 165 
<2X1> 207 
<212> PRT 

<213> Arabidopsis thai i ana 



<400> 165 
Met Glu Asn Met 
1 

Asn Val His Lys 

20 

Ser His Val Leu 
35 

Arg Asn Lys Thr 
50 

Gly Lys Val His 
65 

Thr Glu Ala Asn 

Leu Cys Asn Met 

100 

Leu Pro Ser Lys 
115 

Tyr Thr Ser Met 
13 0 

Ser Lys Leu Ser 
145 

Ser His Glu Trp 

Asp Gly Arg Gin 

180 

Gin Lys Lys Thr 
195 



Ser Asn Leu Thr 
5 

His Cys Ala Val 

Lys Gin Val Cys 

40 

Gin Ala Arg Ser 
55 

Pro Val Ser Arg 
70 

Ser His Gly Lys 
85 

Glu Gin Leu Val 

Thr He Leu Pro 

120 

He Phe Val Thr 
135 

Asp Leu Gly Val 
150 

Asn Val Asp Ala 
165 

Met Asp Lys Leu 

Ala Ala Ala Ala 

200 



Ser Lys Phe Leu 
10 

Ser Asp Glu Asn 
25 

Ser Cys He Cys 

Gin Lys Gly Ser 

60 

Met Glu Lys Trp 

75 

He He Ala Arg 
90 

Val Asn Phe Lys 
105 

He Tyr Gin Glu 

He Asp Val Glu 

140 

Lys He Cys Leu 
155 

Thr Pro Thr Val 
170 

Val Gly Gly Asp 
185 

Asn Leu Leu Leu 



Leu Asn Pro Leu 
15 

Gly Asp Arg Lys 
30 

Cys Cys Asn Arg 
45 

Tyr Phe He Lys 

Glu Glu Lys He 

80 

His Asp Leu He 
95 

Ala Ser Trp Cys 
110 

Leu Ala Ser Thr 
125 

Glu Leu Ala He 

He Gin Glu Phe 

160 

Val Phe Leu Lys 
175 

Ala Ala Glu Leu 
190 

Arg Gin Ser 
205 



<210> 166 
<211> 175 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 166 
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Met 


Leu 


He 


Pro 


1 








Ser 


Ala 


Asn 


Pro 








20 


Ser 


Glu 


Leu 


Glu 






35 




Leu 


He 


Leu 


Tyr 




50 






Ser 


Pro 


Leu 


Tyr 


65 








Leu 


Lys 


Val 


Asp 


He 


Ser 


Ser 


Val 








100 


Asp 


Lys 


Val 


Val 






115 




Gin 


His 


Ser 


Ser 




130 






His 


Ser 


Asp 


Leu 


145 








Gin 


Lys 


Leu 


Lys 



His Ala Val Ser 
5 

Asp Gin Asn Arg 

Ala Lys Thr Lys 

40 

Phe Thr Ala Thr 
55 

Ser Asn Leu Ala 
70 

He Asp Lys Ala 
85 

Pro Thr Phe Cys 

Gly Ala Asp Lys 

120 

Ser Lys Ala Arg 
135 

Leu Leu Leu Val 
150 

Thr Lys Thr Gly 
16 5 



Phe Ala Phe Thr 
10 

Glu Val He Ser 
25 

Ala Ala Lys Lys 

Trp Cys Gly Pro 

60 

Thr Gin His Ser 
75 

Asn Asp Val Ala 
90 

Phe He Arg Asp 
105 

Gly Ser Leu Glu 

Tyr He Pro Val 

140 

Asn Glu Glu Thr 
155 

Asp Trp Phe His 
170 



Tyr Leu Arg Asn 
15 

He His Ser Thr 
30 

Ala Ser Arg Leu 
45 

Cys Arg Tyr Met 

Arg Val Val Phe 

80 

Ala Ser Trp Asn 
95 

Gly Lys Glu Val 
110 

Gin Lys He Ala 
12 5 

Phe He Lys Tyr 

Pro Thr Ser Asn 

160 

He Asn Leu 
175 



<210> 167 
<211> 132 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 167 



Met 


Arg 


Lys 


Gin 


Glu 


1 








5 


Ser 


Asn 


Asp 


Asn 
20 


Gly 


Leu 


Thr 


He 
35 


Leu 


Asp 


Asn 


Tyr 
50 


Gly 


Ala 


Ser 


Leu 


Phe 


Met 


Leu 


He 


65 










Phe 


Val 


Leu 


Lys 


Leu 
85 


His 


He 


Arg 


Tyr 
100 


Thr 


Val 


Asp 


Glu 
115 


Met 


Phe 


Trp 


Leu 
130 


His 


Ser 





Ser Glu Gly Ala Asn Leu 

10 

Asn Val Lys He Ala Pro 

25 

Asp He Lys Ser Ser Lys 
40 

Trp Tyr Thr Leu Phe Ser 
55 

Lys Cys Ser Met Lys Cys 
70 75 
Ser Asp He Asp Glu Cys 

90 

Pro Thr Phe Gin Phe Tyr 

105 

Gly Ala Gly Glu Gin Arg 
12 0 



Glu Phe Glu Ser Lys 

15 

Asn Asp Gin Ser Phe 
30 

Ser Pro Ala Val He 
45 

Val Phe Thr He Thr 
60 

Leu Asn Glu Asn Gly 

80 

Pro Glu Thr Thr Arg 

95 

Arg Asp Gly Glu Lys 
110 

Leu His Asp Arg Leu 
125 



<210> 168 
<211> 151 
<212> PRT 

<213> Arabidopsis thaliana 

Met°Ala 6 Ser He Ser Leu Ser Ser Ser Thr Val Pro Ser Leu Asn Ser 

1 5 10 15 

Lys Glu Ser Ser Gly Val Ser Ala Phe Ala Ser Arg Ser He Ser Ala 

20 25 30 ^ rr<u 

Val Lys Phe Gin Phe Pro Val Arg Arg He Glu Ala Lys Lys Gin Thr 

35 40 45 

Phe Asp Ser Phe Glu Asp Leu Leu Val Asn Ser Asp Lys Pro Val Leu 

50 55 60 

Val Asp Tyr Tyr Ala Thr Trp Cys Gly Pro Cys Gin Phe Met Val Pro 
65 " 70 75 80 

He Leu Asn Glu Val Ser Glu Thr Leu Lys Asp Lys He Gin Val Val 
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Lys lie Asp Thr 

10 0 

Glu Ala Leu Pro 
115 

Arg Phe Glu Gly 
13 0 

Asp Ser Leu Lys 
145 



85 

Glu Lys Tyr Pro 

Thr Phe lie Leu 

120 

Ala Leu Thr Ala 
135 

Val Lys Pro 
150 



90 

Ser lie Ala Asn 
105 

Phe Lys Asp Gly 

Lys Gin Leu lie 

140 



95 

Lys Tyr Lys lie 
110 

Glu Pro Cys Asp 
125 

Gin Arg lie Glu 



<210> 169 
<211> 236 
<212> PRT 
<213>- Arabidopsis 



thaliana 



<400> 169 
Met Ala Gly Val 
1 

Val Ser Ser Ser 

20 

Pro Cys Leu Pro 
35 

Leu Leu Ser Ser 
50 

Ser Gly Leu Gly 
65 

Val Arg Val Lys 

Trp Trp Glu Arg 

100 

Glu Phe Leu Ser 
115 

Glu Phe Tyr Gly 
130 

Leu Cys Lys Thr 
145 

Asn Phe Asp Glu 

Leu Pro Phe Phe 

180 

Phe Ser Cys Ser 
195 

Phe Pro Gin Leu 
210 

Ser Leu Phe Phe 
225 



Val Arg Leu Thr 

5 

Phe Ser Ser Phe 

Pro Asn Ser Asn 

40 

Ser Pro Ser Cys 
55 

Ser His Leu Pro 
70 

Val Asp Glu Asn 
85 

Asn Ala Pro Asn 

Ala Leu Ser Gly 

120 

Thr Trp Cys Ala 
135 

Ala Val Glu His 
150 

Asn Lys Pro Met 
165 

His Phe Tyr Arg 

Leu Ala Lys Val 

200 

Glu Leu Gly He 
215 

Phe Asp Arg He 
230 



Thr Thr Ser Val 
10 

Ala Thr Ala Leu 
25 

Leu Asn Ser Asp 

Ser Ser Ser His 

60 

Leu Arg Arg Pro 
75 

Val Ala Glu Thr 
90 

Met Val Asp He 
105 

Ala Gly Glu Arg 

Ser Cys Arg Ala 

140 

Pro Asp He Val 
155 

Cys Lys Ser Leu 
170 

Gly Ala Asp Gly 
185 

Lys Lys Ala He 

Thr Leu Gin Thr 

220 

Tyr Gin He Leu 
235 



Gin Ala He Arg 
15 

Asn Pro Leu Gin 
30 

Lys Arg Leu Arg 
45 

Tyr His Pro Ser 

Lys Ser Gin Val 

80 

Glu Pro Pro Lys 
95 

His Ser Thr Glu 
110 

Leu Val He Val 
125 

Leu Phe Pro Lys 

Phe Leu Lys Val 

160 

Asn Val Arg Val 
175 

Gin Leu Glu Ser 
190 

Ser Val Ser Pro 
205 

Lys Arg Thr Thr 



<210> 170 
<211> 131 
<212> PRT 
<213> Hordeum 



bulbosum 



<400> 170 
Met Gly Gly Cys 
1 

Asp Phe Lys Gly 

20 

Asp Gin Lys Val 
35 

Asn Phe Ser Ala 
50 

Tyr Ala Glu Met 
65 

Asp Val Asp Asp 



Val Gly Lys Asp 
5 

Gly Asn Val His 

Ala Glu Ala Asn 

40 

Ser Trp Cys Gly 
55 

Ser Lys Thr Tyr 
70 

Leu Met Asp Phe 
85 



Arg Ser He Val 
10 

Val He Thr Thr 
25 

Lys Asp Gly Lys 

Pro Cys Arg Val 

60 

Pro Gin Leu Met 
75 

Gly Ser Thr Trp 
90 



Glu Asp Lys Leu 
15 

Lys Glu Asp Trp 
30 

He Val Val Ala 
45 

He Ala Pro Val 

Phe Leu Thr He 

80 

Asp He Arg Ala 
95 
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Thr Pro Thr Phe 

100 

Val Gly Ala Asn 
115 

Asp Gly Ser 
13 0 



Phe Phe Leu Lys 

Lys Pro Glu Leu 

120 



Asn Gly Gin Gin 
105 

Glu Lys Lys Val 



lie Asp Lys Leu 
110 

Gin Ala Leu Gly 
125 



<210> 171 
<211> 131 
<212> PRT 

<213> Lolium perenne 



<400> 171 




Met 


Gly 


Gly 


Cys 


1 








Asp 


Phe 


Lys 


Gly 








20 


Asp 


Gin 


Lys 


Val 






35 




Asn 


Phe 


Ser 


Ala 




50 






Tyr 


Ala 


Glu 


Met 


65 




Asp 




Asp 


Val 


Asp 


Thr 


Pro 


Thr 


Phe 








100 


Val 


Gly 


Ala 


Asn 




115 




Asp 


Gly 


Ser 






130 







Val Gly Lys Asp 
5 

Gly Asn Val His 

Ala Glu Ala Asn 

40 

Ser Trp Cys Gly 
55 

Ser Lys Thr Tyr 
70 

Leu Met Asp Phe 
85 

Phe Phe Leu Lys 

Arg Pro Glu Leu 

120 



Arg Ser lie Val 
10 

Val He Thr Thr 
25 

Lys Asp Gly Lys 

Pro Cys Arg Val 

60 

Pro Gin Leu Met 
75 

Ser Ser Thr Trp 
90 

Asn Gly Gin Leu 
105 

Glu Lys Lys Val 



Glu Asp Lys Leu 
15 

Lys Glu Asp Trp 
30 

lie Val Val Ala 
45 

He Ala Pro Val 

Phe Leu Thr He 

80 

Asp He Arg Ala 
95 

He Asp Lys Leu 
110 

Gin Ala He Gly 
125 



<210> 172 
<211> 131 
<212> PRT 
<213> Oryza sativa 



<400> 172 




Met 


Gly 


Ser 


Cys 


1 








Asp 


Phe 


Lys 


Gly 








20 


Asp 


His 


Lys 


He 






35 




Asn 


Phe 


Ser 


Ala 




50 






Tyr 


Ala 


Glu 


Met 


65 








Asp 


Val 


Asp 


Glu 


Thr 


Pro 


Thr 


Phe 








100 


Val 


Gly 


Ala 


Asn 






115 




Asp 


Ser 


Ala 






130 







Val Gly Lys Glu 
5 

Gly Asn Val His 

Ala Glu Ala Asn 

40 

Ala Trp Cys Gly 
55 

Ser Gin Thr Tyr 
70 

Leu Met Asp Phe 
85 

Phe Phe Leu Lys 

Lys Pro Glu Leu 

120 



Arg Ser Asp Glu 
10 

Val He Ser Asn 
25 

Lys Asp Gly Lys 

Pro Cys Arg Val 

60 

Pro Gin Phe Met 

75' 

Ser Ser Ser Trp 
90 

Asn Gly Glu Gin 
105 

Glu Lys Lys Val 



Glu Asp Lys He 
15 

Lys Glu Asn Trp 
30 

He Val He Ala 
45 

He Ala Pro Val 

Phe Leu Thr He 

80 

Asp He Arg Ala 
95 

Val Asp Lys Leu 
110 

Ala Ala Leu Ala 
125 



<210> 173 
<211> 296 
^212> PRT 

<213 > Solarium tuberosum 



<400> 173 

Met Ala Thr Leu Thr Asn Phe Leu Leu Lys Pro Ser Pro Asn Leu Ala 
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1 








Ser 


lie 


Thr 


Lys 








20 


Lys 


Ser 


Lys 


Gin 






35 




Leu 


lie 


Thr 


Lys 




50 






Lys 


Ser 


Glu 


Arg 


65 








Ala 


Leu 


Lys 


Lys 


Lys 


Asp 


Ser 


Glu 








100 


Ser 


Lys 


Thr 


Cys 






115 




Thr 


Glu 


Lys 


Thr 




130 






Pro 


Hie 


Phe 


Asn 


145 








Gly 


He 


Gly 


Pro 


Ser 


His 


Ser 


Glu 








180 


Val 


He 


Gin 


Asp 






195 




Gly 


Leu 


Lys 


His 




210 






Lys 


Leu 


Ser 


Lys 


225 








Gly 


Asp 


Glu 


Asn 


lie 


Glu 


Val 


Pro 








260 


Arg 


Tyr 


Val 


Gly 






275 




Tyr 


Gin 


Gly 


Val 




290 







5 

He Ser Pro Ser 

Ser He Phe Lys 

40 

Ala Thr Ala Ala 
55 

Val Gin Lys Val 
70 

Ala Lys Asn Arg 
85 

Arg Ser Lys Asn 

Asn Asp Val Asp 

120 

Lys Ala Leu Cys 
135 

Phe Tyr Lys Ser 
150 

Asp Leu Leu Ala 
165 

Val Val Gin Leu 

His Lys He Asp 

200 

Cys Gly Pro Cys 
215 

Gin Met Ala Asp 
230 

Asp Ser Cys Met 
245 

Thr Phe Leu Phe 

Ser Gly Lys Gly 

280 

Arg Val Thr Tyr 
295 



10 

Leu Tyr Ser Asn 
25 

Asn Leu Lys Thr 

Pro Asp Val Glu 

60 

Asn Ser Met Glu 
75 

Leu Val Val Val 
90 

lie Tyr Pro Phe 
105 

Phe Leu Leu Val 

Arg Arg Glu Lys 

140 

Met Glu Lys He 
155 

Gly Asp Val Leu 
170 

His Ser Arg Glu 
185 

Lys Lys Leu lie 

Val Lys Val Tyr 

220 

Thr Val Val Phe 
235 

Gin Phe Leu Lys 
250 

He Arg Asp Gly 
265 

Glu Leu He Gly 



15 

Phe Pro Phe Glu 
30 

Asn Lys Pro Leu 
45 

Lys Lys Val Ala 

Glu Leu Asp Glu 

80 

Glu Phe Ala Gly 
95 

Met Val Asn Leu 
110 

He Gly Asp Glu 
125 

He Asp Lys Val 

His Glu Glu Glu 

160 

Tyr Tyr Gly Asp 
175 

Asp Val Glu Lys 
190 

Val Leu Asp Val 
205 

Pro Thr Val He 

Ala Arg Met Asn 

240 

Asp Met Asp Val 
255 

Glu He Cys Gly 
270 

Glu He Leu Arg 
285 



<210> 174 
<211> 131 
<212> PRT 

<213> Secale cereale 



<400> 174 




Met 


Gly 


Gly 


Cys 


1 








Asp 


Phe 


Lys 


Gly 








20 


Asp 


Gin 


Lys 


He 






35 




Asn 


Phe 


Ser 


Ala 




50 






Tyr 


Ala 


Gly 


Met 


65 








Asp 


Val 


Asp 


Asp 


Thr 


Pro 


Thr 


Phe 








100 


Val 


Gly 


Ala 


Asn 






115 




Asp Gly 


Ser 






130 







Val Gly Lys Gly 
5 

Gly Asn Val His 

Glu Glu Ala Asn 

40 

Ser Trp Cys Gly 

55 

Ser Lys Thr Tyr 
70 

Leu Met Asp Phe 
85 

Phe Phe Leu Lys 

Lys Pro Glu Leu 

120 



Arg Ser He Val 
10 

Val He Thr Thr 
25 

Lys Asp Gly Lys 

Pro Cys Arg Val 

60 

Pro Gin Leu Met 
75 

Ser Ser Thr Trp 
90 

Asn Gly Gin Gin 
105 

Glu Lys Lys Val 



Glu Glu Lys Leu 
15 

Lys Glu Asp Trp 
3 0 

He Val Val Ala 
45 

Val Ala Pro Val 

Phe Leu Thr He 

80 

Asp He Arg Ala 
95 

He Asp LyB Leu 
110 

Gin Ala Leu Gly 
125 



<210> 175 
<211> 119 
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<212> PRT 

<213> Secale cereale 
<400> 175 



Met 


Glv 


Gly 


Cys 


Val 


Gly 
-* 


Lys 


Gly Arg 


Ser 


lie 


Val 


Glu 


Glu 


Lys 


Leu 


1 




5 








10 










15 




Asp 


Phe 


Lys 


Gly 


Gly 


Asn 


val 


His Val 


lie 


Thr 


Thr 


Lys 


Glu 


Asp 


Trp 




20 








25 










30 






Asp 


Gin 


Lys 


lie 


Glu 


Glu 


Ala 


Asn Lys 


Asp 


Gly 


Lys 


lie 


Val 


Val 


Ala 




35 










40 








45 








Asn 


Phe 


Ser 


Ala 


Ser 


Trp 


Cys 


Gly Pro- 


Cys 


Arg 


Val 


lie 


Ala 


Pro 


Val 




50 










55 








60 








He 


Tyr 


Ala 


Glu 


Met 


Ser 


Lys 


Thr 


Tyr Pro 


Gin 


Leu 


Met 


Phe 


Leu 


Thr 


65 










70 








75 










80 


Asp 


Val 


Asp 


Asp 


Leu 


Met 


Asp 


Phe Ser 


Ser 


Thr 


Trp 


Asp 


He 


Arg 


Ala 




85 








90 










95 




Thr 


Pro 


Thr 


Phe 


Phe 


Phe 


Leu 


Lys Asn Gly Gin 


Gin 


lie 


Asp 


Lys 


Leu 








100 








105 










110 






Val 


Gly 


Ala 
115 


As n 


Lys 


Pro 


Glu 



















<210> 176 
<211> 106 
<212> PRT 

<213> Manduca sexta 

<400> 176 
Me 
1 



65 



Ser He 


His 


He 


Lys 


Asp 


Ala 


Asp 


Asp 


Leu 


Lys 


Asn 


Arg 


Leu 


Ala 






5 






10 










15 




Ala Gly 


Asp 


Lys 


Leu 


Val 


Val 


He 


Asp 


Phe 


Met 


Ala 


Thr 


Trp 


Cys 


20 








25 










30 






Pro Cys 


Lys 


Met 


He 


Gly 


Pro 


Lys 


Leu 


Asp 


Glu 


Met 


Ala 


Ala 


Glu 


35 








40 










45 






Glu 


Ala Asp 


Ser 


He 


Val 


Val 


Val 


Lys 


Val 


Asp 


Val 


Asp 


Glu 


Cys 


50 








55 










60 








Phe 


He Ala 


Ala 


Asp 


Tyr 


Asn 


He 


Asn 


Ser 


Met 


Pro 


Thr 


Phe 


Val 






70 










75 










80 


Lys Asn 


Ser 


Lys 


Lys 


Leu 


Glu 


Glu 


Phe 


Ser 


Gly Ala 


Asn 


Val 


Asp 




85 










90 










95 




Leu Lys 


Asn 


Thr 


He 


Leu 


Lys 


Leu 


Lys 














100 










105 

















<210> 177 

<211> 221 

<212> PRT 

<213> Bradyrhizobium japonicum 



Met 


Leu 


Asp 


Thr 


Lys 


Pro 


Ser 


Ala 


Thr 


Arg 


Arg 


He 


Pro 


Leu 


Val 


He 


1 






5 










10 










15 


Gly 


Ala 


Thr 


Val 


Ala 


Val 


Gly 


Gly 


Leu 


Ala 


Gly 


Phe 


Ala 


Ala 


Leu 


Tyr 








20 








25 










30 




Ala 


Leu 


Gly 


Leu 


Ser Arg 


Ala 


Pro 


Thr 


Gly 


Asp 


Pro 


Ala 


Cys 


Arg Ala 




35 










40 










45 








Val 


Ala 


Thr 


Ala 


Gin 


Lys 


He 


Ala 


Pro 


Leu 


Ala 


His 


Gly 


Glu 


Val 


Ala 




50 








55 










60 








Phe 


Ala 


Leu 


Thr 


Met 


Ala 


Ser 


Ala 


Pro 


Leu 


Lys 


Leu 


Pro 


Asp 


Leu 


Ala 


65 










70 










75 










80 


Glu 


Asp 


Ala 


Asp 


Gly Lys 


Pro 


Lys 


Lys 


Leu 


Ser 


Asp 


Phe 


Arg Gly 


Lys 






85 










90 










95 




Thr 


Leu 


Leu 


Val 
100 


Asn 


Leu 


Trp 


Ala 


Thr 
105 


Trp 


Cys 


Val 


Pro 


Cys 
110 


Arg 


Lys 


Glu 


Met 


Pro 


Ala 


Leu 


Asp 


Glu 


Leu 


Gin 


Gly 


Lys 


Leu 


Ser 


Gly 


Pro 


Asn 






115 








120 










125 








Phe 


Glu 


Val 


Val 


Ala 


He 


Asn 


He 


Asp 


Thr 


Arg Asp 


Pro 


Glu 


Lys 


Pro 
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13 0 

Lys Thr Phe Leu 
145 

Asp Gin Lys Ala 

Leu Gly Met Pro 

180 

Ala Thr He Ala 
195 

Leu He Arg Ala 
210 



135 

Lys Glu Ala Asn 
150 

Lys Val Phe Gin 
165 

Thr Ser Val Leu 

Gly Pro Ala Glu 

200 

Ala Thr Gly Lys 
215 



140 

Leu Thr Arg Leu 
155 

Asp Leu Lys Ala 
170 

Val Asp Pro Gin 
185 

Trp Ala Ser Glu 

Ala Ala Ala Ala 

220 



Gly Tyr Phe Asn 

160 

He Gly Arg Ala 
175 

Gly Cys Glu He 
190 

Asp Ala Leu Lys 

205 

Leu 



<210> 178 
<211> 167 
<212> PRT 

<213> Haemophilus influenzae 



Met°Lys 7 He Lys Lys Leu Leu Lys Asn Gly Leu Ser Leu Phe Leu Thr 

1 5 10 i 

Phe He Val He Thr Ser He Leu Asp Phe Val Arg Arg Pro Val Val 

20 25 3 

Pro Glu Glu He Asn Lys He Thr Leu Gin Asp Leu Gin Gly Asn Thr 



35 40 45 

Phe Ser Leu Glu Ser Leu Asp Gin Asn Lys Pro Thr Leu Leu Tyr Phe 

50 55 60 

Trp Gly Thr Trp Cys Gly Tyr Cys Arg Tyr Thr Ser Pro Ala He Asn 
65 70 75 

r Leu Ala Lys Glu Gly Tyr 

85 

y Asn Glu Ala Asp Val Asn 
100 

,« t>V»t~ TVit- Val Ann Ard Pro 

120 125 



Ser Leu Ala Lys Glu Gly Tyr Gin Val Val Ser Val Ala Leu Arg Ser 

85 90 
Gly Asn Glu Ala Asp Val Asn Asp Tyr Leu Ser Lys Asn Asp Tyr His 

100 ~ 105 H° 

Phe Thr Thr Val Asn Asp Pro Lys Gly Glu Phe Ala Glu Arg Trp Gin 

115 I 20 125 

He Asn Val Thr Pro Thr He Val Leu Leu Ser Lys Gly Lys Met Asp 

130 I 35 140 

Leu Val Thr Thr Gly Leu Thr Ser Tyr Trp Gly Leu Lys Val Arg Leu 

145 150 155 l g0 

Phe Phe Ala Glu Phe Phe Gly 

165 



<210> 179 
<211> 163 
<212> PRT 

<213> Leishmania major 

Met°Lei 7 Lys Val Ser Ser Lys Glu Hi 3 Tyr Ala Glu He Lys Lys Lys 

1 5 10 

Ala Glu Asp Ser Leu Gly Leu Val Val His Phe Ser Ala Thr Trp Cys 

Glu Pro Cys Thr Ala Val Asn Glu His Leu Thr Lys Gin Ala Ala Glu 

35 40 
Tyr Gly Asp Asn Val Val Phe Ala Glu Val Asp Cys Gly Glu Leu Gly 

Asp Val Cys Glu Ala Glu Gly Val Glu Ser Val Pro Phe Val Ala Tyr 
£"c 70 75 

Phe Arg Thr Pro Leu Val Gly Asp Asp Arg Arg Val Glu Arg Val Ala 

85 90 9 ? 

Asp Val Ala Gly Ala Lys Phe Asp Gin He Asp Met Asn Thr His Ser 

100 1° 5 
Leu Phe Gly Glu Lys Gly Gly Asn Arg Gly Ser Ala Glu Gly Leu Cys 

115 120 125 

His Ser Gly Arg Leu Pro Ala Leu Pro His Glu Ala Ala Arg Gly Arg 
130 135 140 
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Asn Val His His Arg His Pro lie Ser Ser Ala Leu Arg Leu Tyr Trp 
145 150 155 160 

Ser Ala Val 



<210> 180 
<211> 275 
<212> PRT 

<213> Mortierella alpina 



<400> 180 
Met Val Ser Asn 
1 

Gin Val Phe Gin 

20 

Ala Ala Trp Ala 
35 

Leu Ala Ala Lys 
50 

Lys Phe Pro Asp 
65 

Phe Val lie Val 

Asn Ala Pro Glu 

100 

Ser Ser Pro Leu 
115 

Ala Ala Pro Ser 
13 0 

Ala Arg Leu Lys 
145 

Lys Gly Thr Pro 

Glu lie Leu Thr 

180 

Ala Asp Asp Glu 
195 

Thr Phe Pro Gin 
210 

Val Val Lys Glu 
225 

Ala Glu Lys Asp 

Pro Val Met He 

260 

Phe Ser Lys 
275 



Asn Tyr He Asp 
5 

Pro Ser Ser Ser 

Pro Pro Cys Val 

40 

Asn Ala Asn Val 
55 

He Ser Glu Asp 
70 

Lys Glu Gly Thr 
85 

Leu Ala Lys Val 

Pro Thr Gin Ser 

120 

Val Ala Pro Pro 
135 

Glu Leu Thr Ser 
150 

Thr Ala Pro Arg 
165 

Ala Gin Asn He 

Val Arg Gin Ala 

200 

Val Tyr Val Lys 
215 

Leu Val Ala Ser 
230 

Leu Lys Thr Arg 
245 

Phe He Lys Gly 



He Thr Ser Glu 
10 

Thr Val Tyr Ala 
25 

Gin Met Asn Glu 

Asn Phe Leu Lys 

60 

Tyr Glu He Ala 
75 

Val Val Asp Arg 
90 

He Ala Lys Tyr 
105 

Ser Thr Met Ala 

Thr Met Ser Pro 

140 

Ser Ser Ser Val 
155 

Cys Gin Phe Ser 
170 

Arg Phe Ser Ser 
185 

Met Lys Thr Phe 

Gly Glu Phe Val 

220 

Gly Glu Phe Gin 
235 

Met Asp Glu Leu 
250 

Ser Pro Glu Thr 
265 



Asp Asp Phe Ala 
15 

Leu Asn Phe Trp 
30 

Val Phe Glu Glu 
45 

He Glu Ala Glu 

Ala Val Pro Ser 

80 

Val Glu Gly Ala 
95 

Ser Lys Ser Thr 
110 

Ala Ala Gly His 
125 

Glu Glu Met Asn 

Met Ala Phe He 

160 

Arg Gin Leu Leu 
175 

Phe Asn He Leu 
190 

Ser Asp Trp Pro 
205 

Gly Gly Leu Asp 

Ala Leu Val Pro 

240 

He Arg Lys Ala 
255 

Pro Arg Cys Gly 
270 



<210> 181 
<211> 160 
<212> PRT 
<213> Neisseria 

<400> 181 
Met Lys Arg 'Leu 
1 

Ala His Thr Ala 

20 

Pro Gin Asn Leu 
35 

Trp Ala Thr Trp 
50 

Lys Trp Tyr Lys 
65 

Ala Leu Asp Thr 



gonorrhoeae 



He Leu Ala Ala 
5 

Ser Gly Asp Glu 

Gin Ser Leu Lys 

40 

Cys Gly Pro Cys 
55 

Ala Gin Lys Lys 
70 

Ser Asp Asn He 



He Ala Leu Ala 
10 

Leu Ala Gly Trp 
25 

Ala Pro Val Arg 

Arg Lys Glu Met 

60 

Gly Ser Val Asp 
75 

Gly Asn Phe Leu 



Ala Thr Phe Gly 
15 

Lys Asp Asn Thr 
30 

He Ala Asn Leu 
45 

Pro Ala Met Ser 

Met Val Gly He 

80 

Lys Gin Thr Pro 
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85 



Val 


Ser 


Tyr 


Pro 


lie 


Trp 


Arg 


Tyr 








100 










Met 


Lys 


Ser 


Tyr 


Gly 


Asn 


Asn 


Val 




115 










120 


Glu 


Ala 


Pro 


Lys 


Cys 


Gly 


Tyr 


Arg 




130 










135 




Glu 


Lys 


Ser 


Leu 


Thr 


Glu 


Ala 


Val 


145 








150 







90 95 
Thr Gly Ala Asn Ser Arg Ser Phe 
105 110 
Gly Val Leu Pro Phe Thr Val Val 

125 

Gin Thr lie Thr Gly Glu Leu Asn 

140 

Lys Leu Ala His Ser Lys Cys Arg 
155 160 



<210> 182 




<211> 208 




<212 


!> PRT 




<213> Rhizobium 


<400> 182 




Met 


Ala 


Gly 


Ala 


1 








Arg 


Ser Gly 


Asn 








20 


Asp 


Asp 


Val 


Ala 






35 




Ala 


Ala 


Ala 


Thr 




50 






Gin 


Ser 


Met 


Lys 


65 








Thr 


He 


Ala 


Asp 


Thr 


Trp 


Cys 


Ala 








100 


Gin 


Lys 


Asp 


Lys 






115 




Asp 


Ala Gly 


Asp 




130 






Val 


Glu 


Ala 


Leu 


145 








Asp 


Leu 


Lys 


Ala 


lie 


Asp 


Ser 


Glu 








180 


Trp 


Ser Gly 


Arg 






195 





loti 



Leu Ala Gly Ala 
5 

Asn Ala Pro Ala 

Cys Ala Ala Lys 

40 

Gly Glu Val Ala 
55 

Ser Leu Ala Phe 
70 

His Ala Gly Lys 
85 

Pro Cys Arg Ala 

Gly Ser Asp Ala 

120 

Asp Val Lys Pro 
135 

Gly Tyr Phe Arg 
150 

Arg Gly Leu Ala 
165 

Gly Cys Leu He 

Asp Ala Arg Arg 

200 



Val Ala Val Tyr 
10 

Arg Val Ala Val 
25 

Ser Gly Arg Ala 

Ala Leu Leu Pro 

60 

Asn Gly Pro Asp 
75 

Thr Val Leu Leu 
90 

Glu Met Pro Ala 
105 

Phe Gin Val He 

Lys Lys Phe Leu 

140 

Asp Ser Thr Val 
155 

Leu Gly Leu Pro 
170 

Ala His Met Asn 
185 

Leu Val Glu Thr 



Val Ser Glu Ser 
15 

Gly Gly Ser LyB 
30 

Lys Lys lie Ala 
45 

Ala Asp Pro Pro 

Gly Lys Pro Met 

80 

Asn Leu Trp Ala 
95 

Leu Asn Ala Leu 
110 

Ala Val Asn Val 
125 

Lys Glu Thr Gly 

Ala Leu Phe Asn 

160 

Val Thr Met Leu 
175 

Gly Pro Ala Glu 
190 

Ala Leu Gly Ser 
205 



<210> 183 

<211> 176 

<212> PRT 

<213> Rhodobacter capsulatus 



<400> 183 






















Ala 


Gly 


Phe 


Met 


Ala 


Lys 


Pro 


Leu 


Met 


Phe 


Leu 


Pro 


Leu 


Leu 


Val 


Met 


1 






5 










10 










15 




Val 


Gly 


Ala 


Gly 


Tyr 


Phe 


Ala 


Met 


Gin 


Gin 


Asn 


Asp 


Pro 


Asn 


Ala 


Met 






20 








25 










30 


Glu 




Pro 


Thr 


Ala 


Leu 


Ala 


Gly 


Lys 


Glu 


Ala 


Pro 


Ala 


Val 


Arg 


Leu 


Pro 






35 








40 










45 




Gly 




Leu 


Gly 


Ala 


Glu 


Ala 


Pro 


Phe 


Thr Asp Ala Asp 


Leu 


Arg 


Asp 


Lys 




50 










55 










60 








Val 


He 


Lys 


Leu 


Val 


Asn 


Phe 


Trp 


Ala 


Ser 


Trp 


Cys 


Ala 


Pro 


Cys 


Arg 


65 








70 








75 








He 


80 


Glu 


His 


Pro 


Asn 


Leu 


He 


Gly 


Leu 


Lys 


Gin 


Asp 


Gly 


He 


Glu 


Met 










85 








90 










95 


Ala 


Gly 


Val 


Asn 


Trp 


Lys 


Asp 


Thr 


Pro 


Asp 


Gin 


Ala 


Gin 


Gly 


Phe 


Leu 






100 










105 










110 






Glu 


Met 


Gly 
115 


Ser 


Pro 


Tyr 


Thr 


Arg 
120 


Leu 


Gly 


Ala 


Asp 


Pro 
125 


Gly 


Asn 


Lys 
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Met Gly Leu Asp Trp Gly Val Ala 

130 135 
Asp Gly Ala Gly Arg He Leu Thr 
145 150 
Asp Val lie Thr Lys Lys lie Asp 

165 



Gly Val Pro Glu Thr Phe Val Val 

14 0 

Arg He Ala Gly Pro Leu Thr Glu 
155 160 
Pro Leu Leu Ala Gly Thr Ala Asp 
170 175 



<210> 184 
<211> 105 
<212> PRT 

<213> Synechocystis 



<400> 184 



Met 


Ala 


Val 


Lys 


Lys 


1 








5 


Ser 


Pro 


Lys 


Pro 


Val 








20 




Cys 


Gin 


Met 


Met 


Ala 




35 






Gin 


Gin 


He 


Gin 


Val 




50 








Ala 


Thr 


Gin 


Tyr 


Gin 


65 










Gin Gly 


Gin 


Pro 


Val 










85 


Leu 


He 


Gin 


Gin 


Leu 








100 





Gin Phe Ala Asn Phe Ala 

10 

Leu Val Asp Phe Tyr Ala 

25 

Pro He Leu Glu Gin Val 
40 

Val Lys He Asp Thr Asp 
55 

He Gin Ser Leu Pro Thr 
70 75 
His Arg Met Glu Gly Val 

90 

Gin Val Phe Val 

105 



Glu Met Leu Ala Gly 

15 

Thr Trp Cys Gly Pro 
30 

Gly Ser His Leu Arg 
45 

Lys Tyr Pro Ala He 
60 

Leu Val Leu Phe Lys 

80 

Gin Gin Ala Ala Gin 

95 



<210> 185 
<211> 109 
<212> PRT 

<213> Synechocystis 



<400> 185 




Met 


Ser 


Leu 


Leu 


1 








Gly 


Gin 


Thr 


Lys 








20 


Pro 


Cys 


Arg 


Leu 






35 




Gly 


Asp 


Lys 


Leu 




50 






-Ala 


Val 


Ala 


Gin 


65 








Lys 


Asn 


Asn 


Glu 


Lys 


Leu 


Leu 


Glu 






100 



Glu He Thr Asp 
5 

Pro Val Leu Val 

Met Ala Pro Ala 

40 

Lys Val Leu Lys 
55 

Cys Lys Val Glu 
70 

Leu Val Met Thr 
85 

Leu Leu Lys Glu 



Ala Glu Phe Glu 
10 

Tyr Phe Trp Ala 
25 

He Gin Ala He 

Leu Glu Val Asp 

60 

Gly Val Pro Ala 
75 

His Glu Gly Ala 
90 

Glu Leu Asp Phe 
105 



Gin Glu Thr Gin 
15 

Ser Trp Cys Gly 
30 

Ala Lys Asp Tyr 
45 

Pro Asn Pro Ala 

Leu Arg Leu Phe 

80 

He Ala Lys Pro 
95 

He 



<210> 186 
<211> 290 
<212> PRT 

<213> S chi zosac char omyce s pombe 
<400> 186 

Met Ser Val He Glu He Arg Ser Tyr Gin His Trp He Ser Thr He 

15 10 15 

Pro Lys Ser Gly Tyr Leu Ala Val Asp Cys Tyr Ala Asp Trp Cys Gly 

20 25 30 

Pro Cys Lys Ala He Ser Pro Leu Phe Ser Gin Leu Ala Ser Lys Tyr 

35 40 45 

Ala Ser Pro Lys Phe Val Phe Ala Lys Val Asn Val Asp Glu Gin Arg 

50 55 60 

Gin He Ala Ser Gly Leu Gly Val Lys Ala Met Pro Thr Phe- Val Phe 
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Phe 


Glu 






Ala 


Leu 


T.i /"a 
i-iy a 










100 

»1» \J w 


Gly Ala 


jlj c; n 


Ala 

Ji- -L- 






■J- J. — r 




Leu 


Vain 


Glv 


Cvs 




J J u 










His 


Asp 


u. ■* J 








.Leu. 


bill 


OCJL 




Glu 


Val 


VdJ. 


J_t y a 








i pn 


Thr 


Ser 


Car 

OCX. 


7\ 1 a 
V-VJL cl 






195 




Asn 


Leu 


Ser 


Phe 




210 






Glu 


Asp 


lie 


Val 


225 








Arg 


Phe 


Val 


Lys 


Ser 


Asn 


Val 


Gly 








260 


Phe 


Gly 


Glu 


Pro 






275 




Glu 


Ala 








290 







70 

Lys Gin He Asp 
85 

Lys Val Ala Leu 

Ser Ser Ser Ser 

120 

He Glu Asn Pro 
135 

Leu Lys Ser Ala 
150 

Val Asp Glu Gin 
165 

Val His Ser He 

Pro Lys Thr He 

200 

Glu Asp Ala Glu 
215 

Tyr Glu Gin Asp 
23 0 

Phe Gin Arg Val 
245 

Glu Glu Glu Thr 

Val Gly Asp Ser 

280 



75 

Met Leu Thr Gly 
90 

He Ser Ser Lys 
105 

Ala Pro Val Lys 

Gin Leu Glu Cys 

140 

Phe Asn Ser Asn 
155 

Leu Met He Tyr 
170 

Ala He Thr Pro 
185 

Lys Leu Tyr He 

Ser Phe Thr Pro 

220 

Asp Gin Pro Thr 
235 

Asn Ser Leu Val 
250 

Thr Lys He Ser 
265 

Ser Lys Gly Lys 



80 

Ala Asn Pro Gin 
95 

Ala Thr Gly Thr 
110 

Gly Phe Ala Ser 
125 

Leu Asn Gin Gin 

Pro Ser Ser Phe 

160 

He Pro Phe Leu 
175 

Val Lys Gly Glu 
190 

Asn Gin Pro Asn 
205 

Thr Gin Val He 

He He Pro Leu 

240 

He Phe He Tyr 
255 

Arg Leu Glu Leu 
270 

Leu Gin Lys Val 
285 



<210> 187 
<211> 185 
<212> PRT 

<213> Treponema pallidum 



<400> 187 
Met Phe Arg Ser 
1 

Gin Ala Ala Asp 

20 

Lys Ala Leu Glu 
35 

Leu Ser Ser Asp 
50 

Pro Ser Leu Phe 
65 

Trp Cys Pro Pro 

Ala Leu Met Arg 

100 

Asp Ser Arg Lys 
115 

Phe Pro He Tyr 
130 

Ser Arg Gly Leu 
145 

Val Ala Val Val 

Ala Leu Phe Lys 

180 



Asp Leu Val Leu 
5 

Val Ala His Asn 

Arg Phe Arg Phe 

40 

Phe His Ala Lys 
55 

Lys Gly Lys Val 
70 

Cys Arg Ala Glu 
85 

Gly Asn Asp Phe 

Gin Val Glu Ser 

120 

Leu Asp Glu Glu 
135 

Pro Thr Thr Tyr 
150 

Val Gly Ser Val 
165 

Glu Leu Ala Arg 



Ala Val Trp Gly 
10 

Ala Asp Val Pro 
25 

Phe Val Tyr Pro 

Ala Leu Lys Gly 

60 

Thr Leu Leu Asn 
75 

Met Pro Ser Met 
90 

Gin He Val Ala 
105 

Phe He Ala Arg 

Gly Ser Leu Gly 

140 

Val Val Asp Lys 
155 

Glu Tyr Asp Gin 
170 

Asp 
185 



Val Thr Cys Val 
15 

Ser Arg Ser Leu 
30 

Lys Pro Leu Asp 
45 

Glu Ala Leu Val 

Phe Trp Ala Thr 

80 

Asp Arg Met Gin 
95 

Val Asn Val Gly 
110 

Gly Lys His Thr 
125 

Ser Val Phe Ala 

Ala Gly Arg He 

160 

Pro Glu Leu Val 
175 



<210> 188 
<211> 246 
<212> PRT 

<213> Caenorhabditis' elegans 
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<400> 188 

Met Leu Leu Arg Leu 

1 5 
Ser Gly Gly Pro Thr 

20 

Asn Trp Thr Asp Leu 
35 

Pro Trp Cys Pro Ala 
50 

Ala Asp Trp Ser Asp 
65 

Thr Val Asn Pro Gly 

85 

Thr lie Tyr His Val 

100 

Arg Asp Lys Asn Asp 
115 

Val lie Asp Pro Val 
130 

Ala Val Val Ala Val 
145 

His Asn His Leu Val 

165 

Gly Leu Phe Ala Gly 

180 

Phe lie Val lie He 
195 

Ser Gin Gin Ala Lys 
210 

Gly Thr Glu Ser Pro 
225 

Glu Thr Lys Lys Thr 

245 



Leu Ala Val Leu Gly Leu 

10 

Arg Ser Ser Lys Leu Val 

25 

Met Lys Gly Glu Trp Met 
40 

Cys Lys Asp Leu Gin Lys 
55 

Asp Leu Gly He Lys Val 
70 75 
Leu Ser Gly Arg Phe Leu 

90 

Lys Asp Gly Val Phe Arg 

10 5 

Phe lie Ser Phe Val Glu 
12 0 

Pro Asp Tyr Lys His Pro 
135 

Phe Phe Lys Leu Ser Met 
150 155 
Glu Asp Lys Gly He Pro 

170 

Val Thr Leu Ala Leu Gly 

185 

He Asp Gin Val Phe Pro 
200 

Lys Thr Glu Lys Lys Asp 
215 

Thr Lys Lys Asn Gly Asn 
230 235 
Lys 



Phe Ala Val Gly Val 

15 

Phe Leu Asn Glu Glu 
30 

He Glu Phe His Ala 
45 

Ala Trp Asn Ala Phe 
60 

Gly Glu Val Asp Val 

80 

Val Thr Ala Leu Pro 

95 

Gin Tyr Ser Gly Ala 
110 

Asp Lys Lys Tyr Arg 

125, 

Asn Ser Lys Gin Met 
140 

Ser Val Arg Asp Leu 

160 

Ser Trp Ala Ser Tyr 

175 

Cys Val Leu Gly Phe 
190 

Thr Gly Pro Arg Lys 
205 

Ala Lys Lys Asp Ser 
220 

Asn Asn Asn Gly Lys 

240 



<210> 189 
<211> 284 
<212> PRT 

<213> Caenorhabditis elegans 



<400> 189 
























Gin 


Leu 


Met 


Pro 


Val 


lie 


Asn 


Val 


Lys 


Asp 


Asp 


Glu 


Asp 


Phe 


Arg 


Asn 


1 








5 






10 










15 




Ser 


Leu 


Ala 


Gly 
20 


Leu 


Lys 


Ser 


Val 


He 

25 


Val 


Asp 


Phe 


Thr 


Ala 
30 


Val 


Trp 


Cys 


Gly 


Pro 


Cys 


Lys 


Met 


He 


Ala 


Pro 


Thr 


Phe 


Glu 


Ala 


Leu 


Ser 


Asn 


35 










40 










45 






Glu 


Gin 


Tyr 


Leu 


Gly 


Ala 


Val 


Phe 


Leu 


Lys 


Val 


Asp 


Val 


Glu 


He 


Cys 




50 








55 










60 








Val 


Lys 


Thr 


Ser 


Ser 


Glu 


Asn 


Gly Val Asn 


Ser 


Met 


Pro 


Thr 


Phe 


Met 


65 










70 










75 








Ala 


80 


Phe 


Gin 


Ser 


Gly 


Val 


Arg 


Val 


Glu 


Gin 


Met 


Lys 


Gly Ala 


Asp 


Lys 








85 










90 








Ala 


95 


Asp 


Ala 


Leu 


Glu 


Thr 


Met 


Val 


Lys 


Lys 


Tyr 


Ala 


Asp 


Asn 


Ser 


Ala 








100 








105 










110 






Ser 


Leu 


Val 


Ala 


Gly 


Gin 


Met 


Asp 


Leu 


Thr 


Pro 


Leu 


Val 


Asp 


Lys 


Lys 






115 








120 










125 






Phe 


Gin 


Met 


Glu 


Cys 


Leu 


Asn 


Glu 


Ser 


Asp 


Asp 


Thr 


Pro 


Leu Gly Arg 




130 








135 










140 








He 


Leu 


Glu Gly 


Asn 


Cys 


Asn 


Leu 


Val 


Ser 


Asp 


Cys 


Asp 


Glu 


Gin 


Leu 


145 








150 










155 










160 


He 


Ser 


Leu 


Pro 


Phe 


Asn 


Gin 


Pro 


Val 


Lys 


Val 


His 


Ser 


He 


Leu 


He 








165 










170 










175 




Lys 


Gly Val 


Ser 


Asp 


Arg 


Ala 


Pro 


Lys 


Lys 


Val 


Lys 


Val 


Phe 


He 


Asn 






180 










185 










190 




Thr 


Leu 


Pro 


Lys 
195 


Thr 


Thr 


Asp 


Phe 


Asp 
200 


Asn 


Ala 


Thr 


Ala 


Leu 
205 


Glu 


Pro 
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Gin Met Leu Glu 
210 

Val Ala Leu Lys 
225 

Phe He Glu Asn 

Leu Thr Val Phe 

260 

Lys Arg Val Ala 
275 



Phe Asp Glu Ser 
215 

Tyr Val Lys Phe 
23 0 

Asn Val Gly Gly 
245 

Gly Thr Pro Leu 

Gly Lys Ala Gly 

280 



Ser lie Gin Gly 

220 

Gin Asn Val Gin 
235 

Gly Asp Val Thr 
250 

Ser Ala Leu Asn 
265 

Asp Ala Ala His 



His Gly Gin Val 

Asn He Gin Phe 

240 

Glu Leu Val Lys 
255 

Met Asn Glu Phe 
270 



<210> 190 
<211> 287 
<212> PRT 

<213> Drosophila melanogaster 

Met°Ser 9 Val Arg Val He Asn Asp Glu Ser His Phe Gin Ala Glu Leu 

1 5 10 15 

Ala Gin Ala Gly He Gin Leu Val Val Val Asp Phe Thr Ala Ser Trp 

20 25 30 

Cys Gly Pro Cys Lys Arg He Ala Pro He Phe Glu Thr Phe Pro Thr 

35 * ~ 40 45 

Lys Tyr Pro Lys Ala He Phe Leu Lys Val Asp Val Asp Lys Cys Gin 

50 55 60 

Asp Thr Ala Ala Gly Gin Gly Val Ser Ala Met Pro Thr Phe He Phe 
65 70 75 80 

Tyr Arg Asn Arg Thr Lys He Asp Arg Val Gin Gly Ala Asp Val Asn 

85 90 95 

Gly Leu Glu Ala Lys He Gin Glu His He Gly Thr Ser Gly Gly Glu 

100 105 HO 

Glu Gly Gly Glu Asp Tyr Gly Gin Gly Leu Met Glu Leu Asn Thr Phe 

115 " 120 125 

He Ser Lys Gin Glu Cys Glu Cys Leu Asn Glu Ala Asp Asp His Asn 

130 135 140 

Leu Lys His Ala Leu Ala Ser Ala Gly Gly Tyr Leu Gin Ser Asp Cys 
145 150 155 160 

Asp Glu Gin Leu He Leu Ser He Thr Phe Asn Gin Ala Val Lys lie 

165 170 175 

His Ser Leu Lys Phe Lys Ala Pro Ser His Leu Gly Pro Lys Asp Val 

180 185 190 

Lys Leu Phe He Asn Gin Pro Arg Thr He Asp Phe Asp Met Ala Glu 

195 200 205 

Ser Met Asn Ser Val Gin Asp Leu Ser Leu Ala Gin Lys Glu Leu Glu 

210 215 220 

Ser Gly Val Pro Val Asn Leu Arg Tyr Val . Lys Phe Gin Asn Val Gin 
225 230 235 240 

Asn He Gin He Phe Val Lys Asn Asn Gin Ser Gly Gly Asp Val Thr 

245 250 255 

Gin He Asp Tyr He Gly Phe He Gly Ser Pro He Met Thr Thr Lya 

260 " 265 270 ^ 

Met Asn Asp Phe Lys Arg Val Ala Gly Lys Lys Gly Glu Ser His 
275 280 , 285 



<210> 191 
<211> 289 
<212> PRT 
<213> Homo sapien 



Met Val Gly Val Lys Pro Val Gly Ser Asp Pro Asp Phe Gin Pro Glu 

1 5 10 15 

Leu Ser Gly Ala Gly Ser Arg Leu Ala Val Val Lys Phe Thr Met Arg 

20 ~ 25 30 

Gly Cys Gly Pro Cys Leu Arg He Ala Pro Ala Phe Ser Ser Met Ser 



-105- 



WO 02/50289 



PCT7US01/50240 



35 

As n Lys Tyr Pro 
50 

Gin Gly Thr Ala 
65 

Phe Phe Arg Asn 

Val Gly Leu Glu 

100 

Ser Asn Glu Asp 
115 

Phe lie Asn Lys 
130 

Gly Phe Asp Asn 
145 

Cys Asp Glu Gin 

Leu Tyr Ser Met 

1B0 

Tyr Val Lys He 
195 

Ala Glu Arg Ser 
210 

He Lys Glu Asp 
225 

Val Asn Ser Val 

Thr Thr Arg He 

260 

Thr Asn Met Asn 
275 

His 



40 

Gin Ala Val Phe 
55 

Ala Thr Asn Asn 
70 

Lys Val Arg He 
85 

Glu Lys He Lys 

Thr Asp He Pro 

120 

Ala Gly Cys Glu 
135 

Cys Leu Arg Lys 
150 

Leu Leu He Thr 
165 

Lys Phe Gin Gly 

Phe He Asn Leu 

200 

Glu Pro Thr Gin 
215 

Gly He Val Pro 
230 

Thr He Phe Val 
245 

Ser Tyr Phe Thr 

Asp Phe Lys Arg 

280 



Leu Glu Val Asp 

60 

He Ser Ala Thr 
75 

Asp Gin Tyr Gin 
90 

Gin His Leu Glu 
105 

Lys Gly Tyr Met 

Cys Leu Asn Glu 

140 

Asp Thr Thr Phe 
155 

Val Ala Phe Asn 
170 

Pro Asp Asn Gly 
185 

Pro Arg Ser Met 

Ala Leu Glu Leu 

220 

Leu Arg Tyr Val 
235 

Gin Ser Asn Gin 
250 

Phe lie Gly Thr 
265 

Val Val Gly Lys 



45 

Val His Gin Cys 

Pro Thr Phe Leu 

80 

Gly Ala Asp Ala 
95 

Asn Asp Pro Gly 
110 

Asp Leu Met Pro 
125 

Ser Asp Glu His 

Leu Glu Ser Asp 

160 

Gin Pro Val Lys 
175 

Gin Gly Pro Lys 
190 

Asp Phe Glu Glu 
205 

Thr Glu Asp Asp 

Lys Phe Gin Asn 

240 

Gly Glu Glu Glu 
255 

Pro Val Gin Ala 
270 

Lys Gly Glu Ser 
2 85 



<210> 192 
<211> 335 
<212> PRT 
<213> Homo sapien 



<400> 192 




Met 


Glu 


Ala 


Gly 


1 








Gly 


Ser 


Ala 


Gly 






20 


Leu 


Leu 


Val 


Val 






35 




Met 


Asn 


Glu 


Val 




50 






Phe 


Val 


Lys 


Leu 


65 








Glu 


He 


Ser 


Ser 


He 


Asp 


Arg 


Leu 








100 


Gin 


Arg 


His 


Ala 






115 




Leu 


Lys 


Glu 


Asp 




130 






Pro 


Cys 


Met 


Leu 


145 








Phe 


Ser 


Lys 


Gin 


Ser 


Ser 


Phe 


Asp 








180 


Ala 


Tyr 


Ser 


Ser 






19'5 





Ala Ala Glu Ala 
5 

Gin Phe Glu Glu 

His Phe Trp Ala 

40 

Met Ala Glu Leu 
55 

Glu Ala Glu Gly 
70 

Val Pro Thr Phe 
85 

Asp Gly Ala His 

Ser Ser Gly Ser 

120 

Leu Asn Leu Arg 
135 

Phe Met Lys Gly 
150 

Met Val Glu He 
165 

lie Phe Ser Asp 

Trp Pro Thr Tyr 

200 



Ala Val Ala Ala 
10 

Leu Leu Arg Leu 
25 

Pro Trp Ala Pro 

Ala Lys Glu Leu 

60 

Val Pro Glu Val 
75 

Leu Phe Phe Lys 
90 

Ala Pro Glu Leu 
105 

Phe Leu Pro Ser 

Leu Lys Lys Leu 

14 0 

Thr Pro Gin Glu 
155 

Leu His Lys His 
170 

Glu Glu Val Arg 
185 

Pro Gin Leu Tyr 



Val Glu Glu Val 
15 

Lys Ala Lys Ser 
3 0 

Gin Cys Ala Gin 
45 

Pro Gin Val Ser 

Ser Glu Lys Tyr 

B0 

Asn Ser Gin Lys 
95 

Thr Lys Lys Val 
110 

Ala Asn Glu His 
125 

Thr His Ala Ala 

Pro Arg Cys Gly 

160 

Asn He Gin Phe 
175 

Gin Gly Leu Lys 
190 

Val Ser Gly Glu 
2 05 
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Leu 


He 


Gly 


Gly 




210 






Leu 


Asp 


Thr 


He 


225 








Val 


Leu 


Thr 


Asn 


Gin 


Glu 


Ala 


Lye 








260 


Ser 


Thr 


Gly 


Val 






275 




Val 


Arg 


Gin Gly 




290 






Leu 


Tyr 


Val 


Lys 


305 








Leu 


Lys 


Glu 


Asn 



Leu Asp lie He 
215 

Cys Pro Lys Ala 
230 

Lys Ala Ser Val 
245 

Cys Gly Phe Ser 

Glu Tyr Glu Thr 

280 

Leu Lys Ala Tyr 
295 

Gly Glu Leu Val 
310 

Gly Glu Leu Leu 
325 



Lys Glu Leu Glu 

220 

Pro Lys Leu Glu 
235 

Met Leu Phe Met 
250 

Lys Gin He Leu 
265 

Phe Asp He Leu 

Ser Asn Trp Pro 

300 

Gly Gly Leu Asp 
315 

Pro He Leu Arg 
330 



Ala Ser Glu Glu 

Glu Arg Leu Lys 

240 

Lys Gly Asn Lys 
255 

Glu He Leu Asn 
270 

Glu Asp Glu Glu 
285 

Thr Tyr Pro Gin 

He Val Lys Glu 

320 

Gly Glu Asn 
335 



<210> 193 
<211> 131 
<212> PRT 

<213> Phalaris coerulescens 



<400> 193 




Met 


Gly 


Gly 


Cys 


1 








Asp 


Phe 


Lys 


Gly 








20 


Asp 


Gin 


Lys 


He 






35 




Asn 


Phe 


Ser 


Ala 




50 






Tyr 


Ala 


Glu 


Met 


65 








Asp 


Val 


Asp 


Asp 


Thr 


Pro 


Thr 


Phe 








100 


Val 


Gly 


Ala 


Asn 






115 




Asp 


Gly 


Ser 






130 







Val 
5 


Gly 


Lys 


Asp 


Gly 


Asn 


Val 


His 


Ala 


Glu 


Ala 


Asn 








40 


Ser 


Trp 


Cys 


Gly 






55 




Ser 


Lys 


Thr 


Tyr 




70 






Leu 


Val 


Asp 


Phe 


85 








Phe 


Phe 


Leu 


Lys 


Lys 


Pro 


Glu 


Leu 






120 



Arg Gly He Val 
10 

Val He Thr Thr 
25 

Lys Asp Gly Lys 

Pro Cys Arg Val 

60 

Pro Gin Leu Met 
75 

Ser Ser Thr Trp 
90 

Asn Gly Gin Gin 
105 

Glu Lys Lys Val 



Glu Asp Lys Leu 
15 

Lys Glu Asp Trp 
3 0 

He Val Val Ala 
45 

He Ala Pro Val 

Phe Leu Thr He 

80 

Asp He Arg Ala 
95 

lie Asp Lys Leu 
110 

Gin Ala Leu Gly 
125 



<210> 194 
<211> 144 
<212> PRT 

<213> Trypanosoma brucei brucei 



<400> 194 
Met Ser Gly Leu 
1 

Lys Ser Gly Glu 

20 

Leu Tyr Phe Ser 
35 

Val Leu Ala Glu 
50 

Val Val Leu He 
65 

Tyr Gly Lys Met 

Val Ser Glu Leu 

100 

He Thr He Asn 
115 

Thr Arg Val He 



Ala Lys Tyr Leu 
5 

Val Ser Leu Gly 

Ala Ser Trp Cys 

40 

Phe Tyr Glu Lys 
55 

Ser Trp Asp Glu 
70 

Pro Trp Leu Ala 
85 

Gly Lys Thr Phe 

Ala Asp Thr Gly 

120 

Glu Asp Pro Asp 



Pro Gly Ala Thr 
10 

Ser Leu Val Gly 
25 

Pro Pro Cys Arg 

His His Val Ala 

60 

Asn Glu Ser Asp 
75 

Leu Pro Phe Asp 
90 

Gly Val Glu Ser 
105 

Ala He He Gly 
Gly Ala Asn Phe 



Asn Leu Leu Ser 
15 

Lys Thr Val Phe 
3 0 

Gly Phe Thr Pro 
45 

Lys Asn Phe Glu 

Phe His Asp Tyr 

80 

Gin Arg Ser Thr 
95 

He Pro Thr Leu 
110 

Thr Gin Ala Arg 
125 

Pro Trp Pro Asn 
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130 135 140 

<210> 195 
<211> 333 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 195 . 

Met Asn Gly Leu Glu Thr His Asn Thr Arg Leu Cys lie Val Gly Ser 

! 5 10 15 

Gly Pro Ala Ala His Thr Ala Ala He Tyr Ala Ala Arg Ala Glu Leu 

20 25 30 

Lys Pro Leu Leu Phe Glu Gly Trp Met Ala Asn Asp He Ala Pro Gly 

35 40 45 

Gly Gin Leu Thr Thr Thr Thr Asp Val Glu Asn Phe Pro Gly Phe Pro 

50 55 60 

Glu Gly He Leu Gly Val Glu Leu Thr Asp Lys Phe Arg Lys Gin Ser 
65 70 75 80 

Glu Arg Phe Gly Thr Thr He Phe Thr Glu Thr Val Thr Lys Val Asp 

85 90 95 

Phe Ser Ser Lys Pro Phe Lys Leu Phe Thr Asp Ser Lys Ala He Leu 

100 105 HO 

Ala Asp Ala Val He Leu Ala Thr Gly Ala Val Ala Lys Arg Leu Ser 

115 120 125 

Phe Val Gly Ser Gly Glu Ala Ser Gly Gly Phe Trp Asn Arg Gly He 

130 135 140 

Ser Ala Cys Ala Val Cys Asp Gly Ala Ala Pro He Phe Arg Asn Lys 
145 ' 150 155 160 

Pro Leu Ala Val He Gly Gly Gly Asp Ser Ala Met Glu Glu Ala Asn 

165 ' 170 175 

Phe Leu Thr Lys Tyr Gly Ser Lys Val Tyr He He His Arg Arg Asp 

180 " 185 190 

Ala Phe Arg Ala Ser Lys He Met Gin Gin Arg Ala Leu Ser Asn Pro 

195 200 205 

Lys He Asp Val He Trp Asn Ser Ser Val Val Glu Ala Tyr Gly Asp 

210 " 215 220 

Gly Glu Arq Asp Val Leu Gly Gly Leu Lys Val Lys Asn Val Val Thr 
225 230 235 240 

Gly Asp Val Ser Asp Leu Lys Val Ser Gly Leu Phe Phe Ala He Gly 

245 250 255 

His Glu Pro Ala Thr Lys Phe Leu Asp Gly Gly Val Glu Leu Asp Ser 

260 265 270 

Asp Gly Tyr Val Val Thr Lys Pro Gly Thr Thr Gin Thr Ser Val Pro 

275 280 285 

Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Lys Tyr Arg Gin Ala 

290 295 300 

He Thr Ala Ala Gly Thr Gly Cys Met Ala Ala Leu Asp Ala Glu His 
305 310 315 320 

Tyr Leu Gin Glu He Gly Ser Gin Gin Gly Lys Ser Asp 

325 330 



<210> 196 
<211> 383 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 196 

Met Cys Trp He Ser Met Ser Gin Ser Arg Phe He He Lys Ser Leu 

1 " ~ 5 10 15 

Phe Ser Thr Ala Gly Gly Phe Leu Leu Gly Ser Ala Leu Ser Asn Pro 

20 25 30 

Pro Ser Leu Ala Thr Ala Phe Ser Ser Ser Ser Ser Ser Ser Ser Ala 

35 40 45 

Ala Ala Ala Val Asp Met Glu Thr His Lys Thr Lys Val Cys He Val 
50 55 60 
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Gly Ser Gly Pro 
65 

Glu Leu Lys Pro 

Pro Gly Gly Gin 

100 

Phe Pro Glu Gly 
115 

Gin Ser Glu Arg 
130 

Val Asp Phe Ser 
145 

Val Leu Ala Asp 

Leu Ser Phe Thr 

18 0 

Gly He Ser Ala 
195 

Asn Lys Pro Leu 
210 

Ala Asn Phe Leu 
225 

Arg Asp Thr Phe 

Asn Pro Lys He 

260 

Gly Asp Glu Asn 
275 

Val Thr Gly Asp 
290 

He Gly His Glu 
305 

Asp Glu Asp Gly 

Val Val Gly Val 

340 

Gin Ala He Thr 
355 

Glu His Tyr Leu 
370 



Ala Ala His Thr 
70 

Leu Leu Phe Glu 
85 

Leu Thr Thr Thr 

He Leu Gly He 

120 

Phe Gly Thr Thr 
135 

Ser Lys Pro Phe 
150 

Ser Val He He 
165 

Gly Ser Gly Glu 

Cys Ala Val Cys 

200 

Val Val He Gly 
215 

Thr Lys Tyr Gly 
230 

Arg Ala Ser Lys 
245 

Glu Val He Trp 

Gly Arg Val Leu 

280 

Val Ser Asp Leu 
295 

Pro Ala Thr Lys 
310 

Tyr Val Val Thr 
325 

Phe Ala Ala Gly 

Ala Ala Gly Thr 

360 

Gin Glu He Gly 
375 



Ala Ala He Tyr 
75 

Gly Trp Met Ala 
90 

Thr Asp Val Glu 
105 

Asp He Val Glu 

He Phe Thr Glu 

14 0 

Lys Leu Phe Thr 
155 

Ser Thr Gly Ala 
170 

Gly Asn Gly Gly 
185 

Asp Gly Ala Ala 

Gly Gly Asp Ser 

220 

Ser Lys Val Tyr 
235 

He Met Gin Gin 
250 

Asn Ser Ala Val 
265 

Gly Gly Leu Lys 

Lys Val Ser Gly 

300 

Phe Leu Asp Gly 
315 

Lys Pro Gly Thr 
330 

Asp Val Gin Asp 
345 

Gly Cys Met Ala 

Ser Gin Glu Gly 

380 



Ala Ser Arg Ala 

80 

Asn Asp He Ala 
95 

Asn Phe Pro Gly 
110 

Lys Phe Arg Lys 
125 

Thr Val Asn Lys 

Asp Ser Arg Thr 

160 

Val Ala Lys Arg' 
175 

Phe Trp Abu Arg 
190 

Pro He Phe Arg 
205 

Ala Met Glu Glu 

He He His Arg 

240 

Arg Ala Leu Ser 
255 

Val Glu Ala Tyr 
270 

Val Lys Asn Val 
285 

Leu Phe Phe Ala 

Gin Leu Glu Leu 

320 

Thr Lys Thr Ser 
335 

Lys Lys Tyr Arg 
350 

Ala' Leu Asp Ala 
365 

Lys Ser Asp 



<210> 197 
<211> 323 
<212> PRT 

<213> Aquifex aeolicus 

<400> 197 ^ _., 

Met Ala Val Ser Leu Met Gin Gin Pro Asp Lys Val Tyr Asp Val lie 

15 10 15 

He He Gly Ala Gly Pro Ala Gly Thr Thr Ala Ala He Tyr Thr Ala 

20 25 30 

Arg Ala Gly Trp Lys Thr Leu Val Leu Tyr Arg Ala Glu Ala Asp Gly 

35 40 45 

Ala Leu Gly Val Thr Gin Lys He Glu Asn Tyr Pro Gly Val Pro Gly 

50 55 60 

Pro Leu Ser Gly Tyr Glu Leu Leu Lys He Met Arg Glu Gin Ala Lys 
65 7 0 75 8 0 

Ser Phe Gly Ala Glu Phe Val Arg Gly Lys Val He Ala Thr Asp Leu 

85 90 95 

Asn Ser Asp Pro Lys Lys Val Tyr Thr He Asp Gly Arg Glu Phe Arg 

100 105 HO 

Gly Lys Thr He He Val Ala Ser Gly Ala Met Glu Arg Ala Asn Lys 

115 ' 120 125 

Phe Lvs Gly Glu Glu Glu Phe Leu Gly Arg Gly Val Ser Tyr Cys Gly 

130 135 140 

Val Cys Asp Ala Ala Phe Phe Lys Asp Gin Pro Val Ala Val He Gly 
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145 

Asp Asp Asp Tyr 

Asn Lys Val Phe 

180 

Glu Val He Glu 
195 

Arg His Arg Pro 
210 

Lys Leu Lys Asp 
225 

Gly Val Phe He 

Met Gly Gin Val 

260 

Glu Met Met Thr 
275 

Cys Asn Glu Val 
290 

Ala Ala Leu Ala 
305 

Pro Gin Trp 



150 

Ala He Glu Glu 
165 

Phe Val Val Pro 

His Phe Glu Lys 

200 

He Glu He Val 
215 

Leu Glu Lys Lys 
230 

Phe Leu Gly Gly 
245 

Glu Met Thr Glu 

Ser Val Pro Gly 

280 

Lys Gin Ala Val 
295 

Val Asp Lys Phe 
310 



155 

Ala Glu Phe He 
170 

Gly Ser Lys He 
185 

Leu Pro Asn Val 

Gly Asp Gin Val 

220 

Glu Glu Lys Leu 
235 

Thr Lys Pro Ser 
250 

Gly Asp Cys He 
265 

Val Phe Ala Ala 

Val Ala Ala Ala 

300 

Leu Ser Gly Lys 
315 



160 

Ala Arg Phe Ala 
175 

Lys Ala Pro Pro 
190 

Glu He Leu Leu 
205 

Val Lys Gly He 

Leu Glu Val Asn 

240 

Val Asp Phe Leu 
255 

Val Val Asn Glu 
270 

Gly Asp Val Leu 
285 

Met Gly Cys Lys 

Lys Lys He Val 

320 



<210> 198 
<211> 315 
<212> PRT 

<213> Bacillus subtilis 



<400> 198 
Ser Glu Glu Lys 
1 

Gly Met Thr Ala 

20 

Met He Glu Arg 
35 

Val Glu Asn Tyr 
50 

Asn Lys Met Phe 
65 

Gly Asp He Lys 

Ala Gly Ser Lys 

100 

Ala Glu Tyr Lys 
115 

Arg Gly Val Ser 
130 

Lys Glu Leu Val 
145 

Val Tyr Leu Thr 

Asp Lys Leu Arg 

180 

Glu Lys Val Asp 
195 

Glu Asn Gly Lys 
210 

Glu Glu Ser Glu 
225 

Leu Pro Leu Ser 

Gly Tyr He Glu 

260 

Phe Ala Ala Gly 
275 



He Tyr Asp Val 
5 

Ala Val Tyr Thr 

Gly He Pro Gly 

40 

Pro Gly Phe Glu 
55 

Glu His Ala Lys 
70 

Glu Val He Asp 
85 

Glu Tyr Lys Ala 

Lys He Gly Val 

12 0 

Tyr Cys Ala Val 
13 5 

Val Val Gly Gly 
150 

Arg Phe Ala Ser 
165 

Ala Gin Ser He 

Phe Leu Trp Asn 

200 

Val Gly Asn Val 
215 

Phe Lys Thr Asp 
230 

Lys Pro Phe Glu 
245 

Thr Asn Asp Arg 

Asp He Arg Glu 

280 



He He He Gly 
10 

Ser Arg Ala Asn 
25 

Gly Gin Met Ala 

Ser He Leu Gly 

60 

Lys Phe Gly Ala 
75 

Gly Lys Glu Tyr 
90 

Arg Ala Val He 
105 

Pro Gly Glu Lys 

Cys Asp Gly Ala 

140 

Gly Asp Ser Ala 
155 

Lys Val Thr He 
17 0 

Leu Gin Ala Arg 
185 

Lys Thr Val Lys 

Thr Leu Val Asp 

220 

Gly Val Phe He 

- 235 
Asn Leu Gly He 
250 

Met Glu Thr Lys 
265 

Lys Ser Leu Arg 



Ala Gly Pro Ala 
15 

Leu Ser Thr Leu 
30 

Asn Thr Glu Asp 
45 

Pro Glu Leu Ser 

Glu Tyr Ala Tyr 

80 

Lys Val Val Lys 
95 

He Ala Ala Gly 
110 

Glu Leu Gly Gly 
125 

Phe Phe Lys Gly 

Val Glu Glu Gly 

160 

Val His Arg Arg 
175 

Ala Phe Asp Asn 
190 

Glu He His Glu 
205 

Thr Val Thr Gly 

■ 

Tyr He Gly Met 

240 

Thr Asn Glu Glu 
255 

Val Glu Gly He 
270 

Gin lie Val Thr 
285 
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Ala Thr Gly Asp Gly Ser lie Ala Ala Gin Ser Val Gin His Tyr Val 

290 " 295 300 

Glu Glu Leu Gin Glu Thr Leu Lys Thr Leu Lys 

305 310 315 



<210> 199 

<211> 326 

<212> PRT 

<213> Borrelia burgdorferi 



Met°Leu 9 Glu Phe Glu Thr lie Asp He Asn Leu Thr Lys Lys Lys Asn 

1 5 10 15 

Leu Ser Gin Lys Glu Val Asp Phe He Glu Asp Val He He Val Gly 

20 25 30 

Ser Gly Pro Ala Gly Leu Thr Ala Gly He Tyr Ser Val Met Ser Asn 

35 40 45 

Tyr Lys Ala Ala He Leu Glu Gly Pro Glu Pro Gly Gly Gin Leu Thr 

50 55 60 

Thr Thr Thr Glu Val Tyr Asn Tyr Pro Gly Phe Lys Asn Gly He Ser 
65 70 75 80 

Gly Arg Asn Leu Met Leu Asn Met Arg Glu Gin Val Val Asn Leu Gly 

85 90 95 

Ala Lys Thr Phe Pro Glu Thr Val Phe Ser He Lys Arg Lys Gly Asn 

100 105 11° 

He Phe Tyr Leu Tyr Thr Glu Asn Tyr He Tyr Lys Ser Lys Ala Val 

115 120 125 

He He Ala Val Gly Ser Lys Pro Lys Lys Leu Glu Thr Leu Lys Asn 
-, o ^ ' 1 71; 140 



Ser Gly Leu Phe Trp Asn Lys Gly He Ser Val Cys Ala He Cys Asp 
145 150 155 160 

Gly His Leu Phe Lys Gly Lys Arg Val Ala Val He Gly Gly Gly Asn 

165 170 I 75 

Thr Ala Leu Ser Glu Ser He Tyr Leu Ser Lys Leu Val Asp Lys Val 

180 185 190 

Tvr Leu He Val Arg Lys Asn Asn Leu Arg Ala He Ala Met Leu Arg 

J i«cr 900 205 



130 ' 135 m 140 

Leu Phe Trp Asn Lys Gly 

150 

Leu Phe Lys Gly Lys Arg 
165 

Leu Ser Glu Ser He Tyr 
180 

He Val Arg Lys Asn Asn 
195 200 
Asp Ser Val Ala Lys Leu Pro Asn He Glu He Leu Tyr Asn Ser Glu 

210 215 220 

Ala He Glu Val Asp Gly Lys Ser Ser Val Ser Ser Val Lys He Phe 

111 Lys Lys Asp Asn Val Val Tyr Glu Leu Glu Val Ser Ala Val Phe 

245 250 255 

Met Ala Val Gly Tyr Lys Pro Asn Thr Glu Phe Leu Lys Gly Phe Leu 

* 265 2 /0 

Gly Phe He Val Thr Lys 
2 80 

Phe Ser Cys Gly Asp Val 
295 

Ala Ala Ala Glu Gly Phe 
305 310 315 

Leu Gly Asn Phe Leu Lys 

3 25 



245 250 
Tyr Lys Pro Asn Thr 
2 60 265 - - 

Asp Leu Asp Glu Glu Gly Phe He Val Thr Lys Asp Val Val Lys Thr 

275 280 285 

Ser Val Asp Gly Val Phe Ser Cys Gly Asp Val Ser Asn Lys Leu Tyr 

290 295 300 

Ala Gin Ala He Thr Ala Ala Ala Glu Gly Phe He Ala Ser Val Glu 



<210> 200 
<211> 319 
<212> PRT 

<213> Buchnera aphidicola 
<400> 200 

Met Asp Lys Val Lys His Ser Lys 

1 5 
Ala Gly Tyr Thr Ala Ala He Tyr 

2 0 

Phe Leu He Thr Gly Thr Asn Lys 



He He He Leu Gly Ser Gly Pro 

10 15 
Ala Ala Arg Ala Asn Leu Asp Pro 
25 30 
Gly Gly Gin Leu Met Asn Thr Asn 
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35 

Glu lie Glu Asn 
50 

Leu Met Asn Arg 
65 

He Cys Asp Thr 

Leu He Gly Glu 

100 

Thr Gly Ala Asn 
115 

Lys Gly Lys Gly 
13 0 

Lys Asn Lys Glu 
145 

Glu Thr Leu Tyr 

Arg Gly He Asn 

180 

Lys Lys He Lys 
195 

Lys Asn He Leu 
210 

Gin Lys Asn Ser 
225 

Leu Phe Val Ala 

Lys Leu Lys Met 

260 

Gly Asn Tyr Thr 
275 

Val He Asp His 
290 

Cys Met Ala Ala 
305 



40 

Trp Pro Gly Asp 
55 

Met Tyr Lys His 
70 

Val He Ser Val 
85 

Asn Asn Lys Tyr 

Pro Arg Tyr Leu 

120 

Val Ser Thr Cys 
135 

Val Ala Val Val 
150 

Leu Ser Asn Phe 
165 

Phe Arg Ala Glu 

Ser Gin Lys He 

200 

Gly Asn Ser Ser 
215 

Lys Glu Lys Thr 

. 230 
He Gly Tyr Thr 
245 

Lys Asp Gly Tyr 

Gin Thr Ser He 

280 

Val Tyr Arg Gin 
295 

Leu Asp Ser Glu 
310 



Tyr Asn Lys He 

60 

Ala He Glu Leu 

75 

Asn Phe Lys Lys 
90 

Thr Ala Asp Ser 
105 

Gly Leu Gin Ser 

Ala Val Cys Asp 

140 

Gly Gly Gly Asn 
155 

Val Lys Lys Val 
170 

Lys He Leu Leu 
185 

He He Tyr Leu 

Gly Val Thr Ala 

220 

Glu Ser Lys He 
235 

Pro Asn Thr Asn 
250 

He Gin Val Thr 
265 

Pro Gly He Phe 

Ala He Thr Ser 

300 

Arg Tyr He Asn 
315 



45 

Ser Gly Ser Glu 

Lys Thr Lys Val 

80 

Asn Pro Phe Phe 
95 

Val He He Ala 
110 

Glu Ser Leu Phe 
125 

Gly Phe Phe Tyr 

Thr Ala He Glu 

160 

His Leu He His 
175 

Asp Arg Leu Glu 
190 

Asn Ser He Val 
205 

Leu Leu He Glu 

Gin Val Ser Gly 

240 

He Phe Val Asn 
255 

Arg Gin Glu His 
270 

Ala Ala Gly Asp 
285 

Ser Ala Ser Gly 
Ser Leu Val 



<210> 201 
<211> 319 
<212> PRT 
<213> Buchnera 



aphidicola 



<400> 201 
Met Glu Leu Lys 
1 

Ala Gly Tyr Thr 

20 

Leu Leu He Thr 
35 

Glu He Glu Asn 
50 

Leu Met Asn Arg 
65 

Val Tyr Asp Asn 

Leu Leu Gly Glu 

100 

Thr Gly Ala Asn 
115 

Lys Gly Lys Gly 
130 

Lys Asn Lys Glu 
145 

Glu Thr Leu Tyr 

Arg Arg Asn Asn 

180 



Asn His Lys Lys 
5 

Ala Ala He Tyr 

Gly He Asn Lys 

40 

Trp Pro Gly Asp 
55 

Met His Glu His 
70 

He He Ser Val 
85 

Tyr Asn Lys Tyr 

Pro Arg Tyr Leu 

120 

He Ser Thr Cys 
135 

He Ala Val Val 
150 

Leu Ser Asn Phe 
165 

Phe Lys Ala Glu 



He He He Leu 
10 

Ser Ser Arg Ala 
25 

Gly Gly Gin Leu 

Phe Lys Lys He 

60 

Ser Leu Lys Phe 
75 

Glu Phe Lys Lys 
90 

Thr Cys Asp Ala 
105 

Gly Leu Ser Ser 

Ala Val Cys Asp 

140 

Gly Gly Gly Asn 
155 

Val Lys Lys He 
170 

Lys He Leu He 
185 



Gly Ser Gly Pro 
15 

Asn Leu Asn Pro 
30 

Met Asn Thr Asn 
45 

Thr Gly Pro Glu 

Lys Thr Glu" He 

80 

Lys Pro Phe Phe 
95 

Val He He Ala 
110 

Glu Asn Lys Phe 
125 

Gly Phe Phe Tyr 

Thr Ala He Glu 

160 

Tyr Leu He His 
175 

ABp Arg Leu Leu 
190 



-112- 



WO 02/50289 



PCT/US01/50240 





lie 


Val 


Lys 


Thr 


Lys 


Lys 


Val 


195 










*i r\ r\ 

2 0 0 


Glu 


Asp 
210 


He 


Leu 


Gly 


Asn 


Asn 

21b 


Lys 


Asn 


Lys 


Asn 


Leu 


Lys 


Glu 


Lys 


Lys 


225 








230 






Leu 


Phe 


Val 


Ala 


He 
245 


Gly 


Tyr 


He 


Gin 


Leu 


Lys 


Met 
260 


Lys 


Asp 


Gly 


Tyr 


Gly 


Asn 


Tyr 


Thr 


Gin 


Thr 


Asn 


He 




275 










280 


Val 


He 


Asp 


His 


Val 


Tyr 


Arg 


Gin 




290 








295 




Cys 


Met 


Ala 


Ala 


Leu 


Asp 


Ser 


Glu 


305 










310 







He Leu His Leu Asn Ser Thr He 

205 

Gly Val Thr His Leu Leu He Lys 

220 

Lys Leu Lys He Ala Val Ser Gly 
235 240 
Pro Asn Thr Asp He Phe Thr Asp 

250 255 
He Lys He Lys Lys Gly Thr His 
265 270 
Pro Gly Val Phe Ala Ala Gly Asp 

285 

Ala He Thr Ser Ser Ala Ser Gly 

300 

Arg Tyr Leu Asn Ser Leu Ser 
315 



<210> 202 
<211> 312 
<212> PRT 

<213> Chlamydia muridarum 
<400> 202 

Met Thr His Val Lys Leu Ala He He Gly Ser Gly Pro Ala Gly Tyr 

15 10 15 

Thr Ala Ala He Tyr Ala Ser Arg Ala Leu Leu Thr Pro He Leu Phe 

20 25 30 

Glu Gly Phe Phe Ser Gly He Ala Gly Gly Gin Leu Met Thr Thr Thr 

35 40 45 

Glu Val Glu Asn Phe Pro Gly Phe Pro Gin Gly Val Leu Gly His Gin 

50 55 60 

Leu Met Glu Asn Met Lys Met Gin Ala Gin Arg Phe Gly Thr Gin Val 
65 70 75 80 

He Ala Lys Asp He Thr Ser Val Asp Phe Ser Val Arg Pro Phe Val 

85 90 95 

Leu Lys Ser Gly Glu Asp Thr Phe Thr Cys Asp Ala Cys He He Ala 

100 105 HO 

Thr Gly Ala Ser Ala Lys Arg Leu Ser He Pro Gly Ala Gly Asp Asn 

115 ~ 120 125 

Glu Phe Trp Gin Lys Gly Val Thr Ala Cys Ala Val Cys Asp Gly Ala 

130 135 140 

Ser Pro He Phe Arg Asp Arg Asp Leu Phe Val He Gly Gly Gly Asp 
145 150 155 160 

Ser Ala Leu Glu Glu Ala Met Phe Leu Thr Arg Tyr Gly Lys Arg Val 

165 170 175 

Phe Val Val His Arg Arg Asp Thr Leu Arg Ala Ser Lys Ala Met Val 

180 185 190 

Asn Lys Ala Gin Ala Asn Glu Lys He Val Phe Leu Trp Asn Ser Glu 

195 200 205 

Val Val Lys He Leu Gly Asp Ser Leu Val Arg Ser He Asp He Phe 

210 215 220 

Asn Asn Val Glu Lys Thr Thr Val Thr Met Glu Ala Ala Gly Val Phe 
225 230 235 240 

Phe Ala He Gly His Gin Pro Asn Thr Ala Phe Leu Gly Gly Gin Leu 

245 250 255 

Ser Leu Asp Glu Asn Gly Tyr He He Thr Glu Lys Gly Ser Ser Arg 

260 . 265 270 

Thr Ser Val Pro Gly Val Phe Ala Ala Gly Asp Val Gin Asp Lys Tyr 

275 280 285 

Tyr Arg Gin Ala He Thr Ser Ala Gly Ser Gly Cys Met Ala Ala Leu 

290 295 300 

Asp Ala Glu Arg Phe Leu Glu Lys 
305 310 



<210> 203 
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<211> 311 
<212> PRT 

<213> Chlamydia pneumoniae 



<400> 203 
Met He His Ser 
1 

Thr Ala Ala He 

20 

Glu Gly Phe Phe 
35 

Glu Val Glu Asn 
50 

Leu Met Asn Asn 
65 

Leu Ala Gin Asp 

Leu Lys Ser Lys 

100 

Thr Gly Ala Ser 
115 

Glu Phe Trp Gin 
130 

Ser Pro He Phe 
145 

Ser Ala Leu Glu 

Tyr Val Val His 

180 

Ala Arg Ala Gin 
195 

He Val Lys He 
210 

Asn Val Gin Thr 
225 

Phe Ala He Gly 

Thr Leu Asp Glu 

260 

Thr Ser Val Pro 
275 

Tyr Arg Gin Ala 
290 

Asp Ala Glu Arg 
305 



Arg Leu He He 
5 

Tyr Ala Ser Arg 

Ser Gly He Ser 

40 

Phe Pro Gly Phe 
55 

Met Lys Glu Gin 
70 

He He Ser Val 
85 

Glu Glu Thr Tyr 

Ala Lys Arg Leu 

120 

Lys Gly Val Thr 
135 

Lys Asn Lys Asp 
150 

Glu Ala Leu Tyr 
165 

Arg Arg Asp Lys 

r> 

Asn Asn Glu Lys 

200 

Ser Gly Asp Ser 
215 

Gin Glu He Thr 
230 

His Lys Pro Asn 
245 

Ser Gly Tyr He 

Gly Val Phe Ala 

280 

Val Thr Ser Ala 
295 

Phe Leu Gly 
310 



He Gly Ser Gly 
10 

Ala Leu Leu His 
25 

Gly Gly Gin Leu 

Pro Glu Gly He 

60 

Ala Val Arg Phe 
75 

Asp Phe Ser Val 
90 

Ser Cys Asp Ala 
105 

Glu He Pro Gly 

Ala Cys Ala Val 

140 

Leu Tyr Val He 
155 

Leu Thr Arg Tyr 
170 

Leu Arg Ala Ser 
185 

He Thr Phe Leu 

He Val Arg Ser 

220 

Thr Arg Glu Ala 
235 

Thr Asp Phe Leu 
250 

Val Thr Glu Lys 
265 

Ala Gly Asp Val 

Gly Ser Gly Cys 

300 



Pro Ser Gly Tyr 
15 

Pro Leu Leu Phe 
30 

Met Thr Thr Thr 
45 

Leu Gly Pro Lys 

Gly Thr Lys Thr 

80 

Arg Pro Phe He 
95 

Cys He He Ala 
110 

Ala Gly Asn Asp 
125 

Cys Asp Gly Ala 

Gly Gly Gly Asp 

160 

Gly Ser His Val 
175 

Lys Ala Met Glu 
190 

Trp Asn Ser Glu 
205 

Val Asp He Lys 

Ala Gly Val Phe 

240 

Gly Gly Gin Leu 
255 

Gly Thr Ser Lys 
270 

Gin Asp Lys Tyr 
285 

He Ala Ala Leu 



<210> 204 
<211> 312 
<212> PRT 

<213> Chlamydia trachomatis 

Met°Thr°Hi S Ala Lys Leu Val He He Gly Ser Gly Pro Ala Gly Tyr 

1 5 10 i- 

Thr Ala Ala He Tyr Ala Ser Arg Ala Leu Leu Thr Pro Val Leu Phe 

Glu Gly Phe Phe Ser Gly He Ala Gly Gly Gin Leu Met Thr Thr Thr 

35 40 ^ 

Glu Val Glu Asn Phe Pro Gly Phe Pro Glu Gly Val Leu Gly His Gin 

55 

Lys Thr Gin Ala Gin Ar< 
70 75 



Leu Met Asp Leu Met Lys Thr Gin Ala Gin Arg Phe Gly Thr Gin Val 

Leu Ser Lys Asp He Thr Ala Val Asp Phe Ser Val Arg Pro Phe Val 

85 90 yb 

Leu Lys Ser Gly Lys Glu Thr Phe Thr Cys Asp Ala Cys He He Ala 

100 105 110 
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Thx 


Gly 


Ala 


Ser 






115 




Glu 


Phe 


Trp 


Gin 




130 






Ser 


Pro 


He 


Phe 


145 








Ser 


Ala 


Leu 


Glu 


Phe 


Val 


Val 


His 








180 


Asn 


Lys 


Ala 


Gin 






195 




lie 


Val 


Lys 


He 




210 






Asn 


Asn 


Val 


Asp 


225 








Phe 


Ala 


He 


Gly 


Ala 


Leu 


Asp 


Glu 








260 


Thr 


Ser 


Val 


Pro 






275 




Tyr 


Arg 


Gin 


Ala 




290 






Asp 


Ala 


Glu 


Arg 


305 









Ala 


Lys 


Arg 


Leu 








12 0 


Lys 


Gly 


Val 


Thr 






135 




Arg 


Asp 


Lys 


Asp 




150 






Glu 


Ala 


Met 


Phe 


165 








Arg 


Arg 


Asp 


Thr 


Ala 


Asn 


Glu 


Lys 








200 


Ser 


Gly 


Asp 


Thr 






215 




Glu 


Thr 


Thr 


Thr 




230 






His 


Gin 


Pro 


Asn 


245 








Asn 


Gly 


Tyr 


He 


Gly 


Val 


Phe 


Ala 






280 


He 


Thr 


Ser 


Ala 






295 




Phe 


Leu 


Glu 


Asn 




310 







Ser He Pro Gly 

Ala Cys Ala Val 

140 

Leu Phe Val Val 
155 

Leu Thr Arg Tyr 
170 

Leu Arg Ala Ser 
185 

He Phe Phe Leu 

Leu Val Arg Ser 

220 

Thr Met Glu Ala 
235 

Thr Ala Phe Leu 
250 

He Thr Glu Lys 

Ala Gly Asp Val 

Gly Ser Gly Cys 

300 



Ala Gly Asp Asn 
125 

Cys Asp Gly Ala 

Gly Gly Gly Asp 

160 

Gly Lys Arg Val 
175 

Lys Val Met Val 
190 

Trp Asn Ser Glu 
205 

He Asp He Tyr 

Ala Gly Val Phe 

240 

Gly Gly Gin Val 
255 

Gly Ser Ser Arg 
270 

Gin Asp Lys Tyr 
285 

Met Ala Ala Leu 



<210> 205 
<211> 315 
<212> PRT 

<213> Clostridium litorale 



<400> 205 
Met Glu Asn Val 
1 

Leu Ala Ala Ala 

20 

Leu Glu Gly Met 

35 

Ala Asn Tyr Pro 
50 

Gly Arg Met Glu 
65 

Asp Asn He Val 

Gly Ala Lys Gly 

100 

Ala Ser Pro Lys 
115 

Lys Gly Val Ser 
13 0 

Met Glu Val Phe 
145 

Met Phe Leu Thr 

Ala Glu Leu Arg 

180 

Glu Lys Leu Asn 
195 

Asp Gly He Val 
210 

Val Thr Glu Phe 
225 

Val Phe He Gly 
Glu Leu Asp Glu 



Tyr Asp He Ala 
5 

Leu Tyr Gly Ala 

Lys Val Gly Gly 

40 

Gly Ser Val Pro 
55 

Glu Gin Val Glu 
70 

Asp Val Asp Phe 
85 

Glu Tyr Lys Ala 

Leu Ala Gly Cys 

120 

Tyr Cys Ala Thr 
135 

Val He Gly Gly 
150 

Lys Phe Ala Arg 
165 

Ala Ala Lys Ser 

Phe Met Trp Asn 

200 

Glu Ser Ala Val 
215 

Val Ala Pro Glu 
230 

Tyr Asp Pro Lys 
245 

Thr Gly Tyr He 



He He Gly Ser 
10 

Arg Ala Lys Met 
25 

Gin He Val He 

Glu Ala Thr Gly 

60 

Glu Phe Gly Ala 
75 

Thr Asp Lys He 
90 

Lys Ala Val He 
105 

Pro Gly Glu Lys 

Cys Asp Ala Asp 

14 0 

Gly Asp Thr Ala 
155 

Lys Val Thr He 
170 

He Gin Glu Lys 
185 

Thr Val He Glu 

Phe Lys Asn Arg 

22 0 

Glu Asp Gly Thr 
235 

Ser Ala Leu Val 
250 

Pro Thr Asp Asp 



Gly Pro Ala Gly 
15 

Lys Thr Leu Leu 
30 

Thr His Glu Val 
45 

Pro Ser Leu He 

Glu Arg Val Met 

80 

Lys Val Leu Lys 
95 

Val Ala Thr Gly 
110 

Glu Leu Thr Gly 
125 

Phe Phe Glu Asp 

Val Glu Glu Ala 

160 

Val His Arg Arg 
175 

Ala Phe Lys Asn 
190 

Glu He Lys Gly 
205 

Glu Thr Gly Glu 

Phe Gly He Phe 

240 

Glu Gly Lys Leu 
255 

Asn Met Lys Thr 
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260 

Asn Val Glu Gly Val Phe Ala Ala 
275 280 
Arg Gin Val Val Thr Ala Thr Ala 

290 295 
Ala Glu Lys Tyr lie Glu Glu Leu 
305 310 



265 270 
Gly Asp lie Arg Val Lys Ser Leu 

285 

Asp Gly Ala lie Ala Ala Val Gin 

300 

Phe Ala Glu 
315 



<210> 206 
<211> 321 
<212> PRT 

<213> Coxiella burnetii 



<400> 206 
Met Asn Lys Pro 
1 

Ala Gly Tyr Thr 

20 

lie Met lie Thr 
35 

Asp Val Ala Asn 
50 

Leu Leu Glu Arg 
■ 65 

Phe lie Phe Asp 

Leu Leu Gin Gly 

100 

Ala Thr Gly Ala 
115 

Tyr Met Gly Lye 
13 0 

Tyr Arg Ala Lys 
145 

Glu Glu Ala Leu 

His Arg Arg Asp 

180 

lie Lys Lys Val 
195 

lie Glu Glu Val 
210 

Lys His Val Lys 
225 

Phe lie Ala lie 

Leu Glu Met Asp 

260 

Gly Asn Ala Thr 
275 

Val Arg Gly Gin 
290 

Cys Met Pro Ala 

305 

Ala 



Gin His His Ser 
5 

Asp Ala lie Tyr 

Gly Met Glu Gin 

40 

Trp Pro Gly Glu 
55 

Met Gin Lys His 
70 

His lie Asn Lys 
85 

Asp Asn Ala Thr 

Ser Ala Arg Tyr 

120 

Gly Val Ser Ala 
135 

Lys Val Ala Val 
150 

Tyr Leu Ser His 
165 

Lys Leu Arg Ala 

Glu Glu Gly Lys 

200 

Leu Gly Asp Asp 
215 

Glu Glu Lys Thr 
230 

Gly His Asp Pro 
245 

Glu Ala Gly Tyr 

Ala Thr Asn lie 

280 

Leu Tyr Arg Gin 
295 

Leu Asp Ala Glu 
310 



Leu lie lie Leu 
10 

Val Ala Arg Ala 
25 

Gly Gly Gin Leu 

Ala Pro Gly Leu 

60 

Ala Gly Gly Ala 
75 

Pro Asp Leu Asn 
90 

Tyr Ser Cys Asp 
105 

Leu Gly Leu Pro 

Cys Ala Thr Cys 

140 

Val Gly Gly Gly 
155 

lie Ala Ser His 
170 

Glu Lys Met Leu 
185 

Val Ala lie Val 

Gin Gly Val Thr 

220 

Gin Asp Leu Thr 
235 

Asn Thr Lys lie 
250 

Leu Arg Ala Lys 
265 

Pro Gly Val Phe 

Thr lie Ala Ala 

300 

Arg Tyr Leu Asp 
315 



Gly Ser Gly Pro 
15 

Asn Leu Lys Pro 
30 

Met Thr Thr Thr 
45 

Gin Gly Pro Lys 

Leu Asn Thr Gin 

80 

Pro Arg Pro Phe 
95 

Ala Leu lie lie 
110 

Ser Glu Lys Pro 
125 

Asp Gly Phe Phe 

Asn Thr Ser Val 

160 

Val Thr Leu lie 
175 

Ser Ala Gin Leu 
190 

Trp Ser His Val 
205 

Gly Val His Leu 

lie Asp Gly Leu 

240 

Phe Lys Glu Gin 
255 

Ser Gly Leu Gin 
270 

Pro Ala Val Val 
285 

Ala Gly Met Gly 

Ser Leu Asn Gin 

320 



<210> 207 
<211> 320 
<212> PRT 

<213> Escherichia coli 
<400> 207 

Gly Thr Thr Lys His Ser Lys Leu Leu lie Leu Gly Ser Gly Pro Ala 
15 10 15 
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Gly Tyr Thr Ala 

2 0 

Leu He Thr Gly 
35 

Val Glu Asn Trp 
50 

Met Glu Arg Met 
65 

Phe Asp His lie 

Asn Gly Asp Asn 

100 

Gly Ala Ser Ala 
115 

Gly Arg Gly Val 
13 0 

Asn Gin Lys Val 
145 

Ala Leu Tyr Leu 

Arg Asp Gly Phe 

180 

Lys Val Glu Asn 
195 

Glu Val Thr Gly 
210 

Thr Gin Asn Ser 
225 

Val Ala He Gly 

Glu Leu Glu Asn 

260 

Ala Thr Gin Thr 
275 

Asp His He Tyr 
290 

Ala Ala Leu Asp 
305 



Ala Val Tyr Ala 

Met Glu Lys Gly 

40 

Pro Gly Asp Pro 
55 

His Glu His Ala 
70 

Asn Lys Val Asp 
85 

Gly Glu Tyr Thr 

Arg Tyr Leu Gly 

120 

Ser Ala Cys Ala 
135 

Ala Val He Gly 
150 

Ser Asn He Ala 
165 

Arg Ala Glu Lys 

Gly Asn He He 

200 

Asp Gin Met Gly 
215 

Asp Asn He Glu 
230 

His Ser Pro Asn 
245 

Gly Tyr He Lys 

Ser He Pro Gly 

280 

Arg Gin Ala lie 
295 

Ala Glu Arg Tyr 
310 



Ala Arg Ala Asn 

25 

Gly Gin Leu Thr 

Asn Asp Leu Thr 

60 

Thr Lys Phe Glu 
75 

Leu Gin Asn Arg 
90 

Cys Asp Ala Leu 
105 

Leu Pro Ser Glu 

Thr Cys Asp Gly 

14 0 

Gly Gly Asn Thr 
155 

Ser Glu Val His 
170 

He Leu He Lys 
185 

Leu His Thr Asn 

Val Thr Gly Val 

220 

Ser Leu Asp Val 
235 

Thr Ala He Phe 
250 

Val Gin Ser Gly 
265 

Val Phe Ala Ala 

Thr Ser Ala Gly 

300 

Leu Asp Gly Leu 
315 



Leu Gin Pro Val 
30 

Thr Thr Thr Glu 
45 

Gly Pro Leu Leu 

Thr Glu He He 

80 

Pro Phe Arg Leu 
95 

He He Ala Thr 
110 

Glu Ala Phe Lys 
125 

Phe Phe Tyr Arg 

Ala Val Glu Glu 

160 

Leu He His Arg 
175 

Arg Leu Met Asp 
190 

Arg Thr Leu Glu 
205 

Arg Leu Arg Asp 

Ala Gly Leu Phe 

240 

Glu Gly Gin Leu 
255 

He His Gly Asn 
270 

Gly Asp Val Met 
285 

Thr Gly Cys Met 

Ala Asp Ala Lys 

320 



<210> 208 

<211> 315 

<212> PRT 

<213> Eubacterium acidaminophilum 



<400> 208 
Met Glu Asn Val 
1 

Leu Ala Ala Ala 

20 

He Glu Gly Gin 
35 

Ala Asn Tyr Pro 
50 

Glu Arg Met Glu 
65 

Asp Lys He Val 

Gly Glu Lys Ala 

10 0 

Ala Ala Pro Arg 
115 

Lys Gly Val Ser 
130 

Met Glu Val Phe 
145 

Met Tyr Leu Ala 



Tyr Asp Leu Ala 
5 

Leu Tyr Gly Ala 

Lys Val Gly Gly 

40 

Gly Ser Val Arg 
55 

Glu Gin Ala Asn 
70 

Asp Val Asp Leu 
85 

Glu Tyr Lys Ala 

Leu Ala Gly Cys 

120 

Tyr Cys Ala Thr 
135 

Val Val Gly Gly 
150 

Lys Phe Ala Arg 



He He Gly Ser 
10 

Arg Ala Lys Met 
25 

Gin He Val He 

Glu Ala Thr Gly 

60 

Glu Phe Gly Ala 
75 

Asp Gly Lys He 
90 

Lys Ser Val He 
105 

Pro Gly Glu Gin 

Cys Asp Ala Asp 

140 

Gly Asp Thr Ala 
155 

Lys Val Thr He 



Gly Pro Ala Gly 
15 

Lys Thr He Met 
30 

Thr His Glu Val 
45 

Pro Ser Leu He 

Glu Lys Val Met 

80 

Lys Val He Lys 
95 

Leu Ala Thr Gly 
110 

Glu Leu Thr Gly 
125 

Phe Phe Glu Asp 

Val Glu Glu Ala 

160 

Val His Arg Arg 
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Asp 


Glu 


Leu Arg 




180 


Pro 


Lys 


Leu Asp 






195 


Asp 


Gly 


___ mm — — ^ 

lie Val 




210 




Thr 


Thr 


Glu Tyr 


225 






Val 


Phe 


lie Gly 


Thr 


Leu 


Asp Asp 






260 


Asn 


Val 


Glu Gly 






275 


Arg 


Gin 


Val Val 




290 




Ala 


Glu 


Lys Tyr 


305 







165 








Ala 


Ala 


Lys 


Ser 


Phe 


Met 


Trp 


Asn 








200 


Glu 


Ser 


Ala 


Val 






215 




Phe 


Ala 


Asn 


Glu 




230 






Tyr 


lie 


Pro 


Lys 


245 








Ala 


Gly 


Tyr 


He 


Val 


Phe 


Ala 


Ala 








280 


Thr 


Ala 


Cys 


Ala 






295 




Val 


Glu 


Ala 


Asn 




310 







170 

He Gin Glu Lys 
185 

Ser Ala He Glu 

Phe Lys Asn Leu 

220 

Glu Asp Gly Thr 
235 

Ser Asp Val Phe 
250 

He Thr Asp Asp 
265 

Gly Asp He Arg 

Asp Gly Ala He 

300 

Phe Glu Glu 
315 



175 

Ala Phe Lys Asn 
190 

Glu He Lys Gly 
205 

Val Thr Gly Glu 

Phe Gly He Phe 

240 

Lys Gly Lys He 
255 

Asn Met Lys Thr 
270 

Val Lys Ser Leu 
285 

Ala Ala Thr Gin 



<210> 209 
<2XX> 318 
<212> PRT 
<213> Haemophilus 



influenzae 



<400> 209 
Met Ser Asp He 
1 

Ala Gly Tyr Thr 

20 

Val Leu Val Thr 
35 

Glu He Glu Asn 
50 

Leu Met Gin Arg 
65 

Val Phe Asp His 

Leu Phe Gly Asp 

100 

Thr Gly Ala Ser 
115 

Lys Gly Arg Gly 
130 

Arg Asn Lys Pro 
145 

Glu Ala Leu Tyr 

Arg Arg Asp Ser 

180 

Lys Lys Val Glu 
195 

Asp Glu Val Leu 
210 

Asn Thr Lys Thr 
225 

Val Ala He Gly 

Glu Leu Asn Asn 

260 

Ala Thr Ala Thr 
275 

Asp His Asn Tyr 
290 

Ala Ala Leu Asp 
305 



Lys His Ala Lys 
5 

Ala Ala He Tyr 

Gly Leu Gin Gin 

40 

Trp Pro Gly Asp 
55 

Met Leu Gin His 
70 

He Asn Arg Val 
85 

Val Gin Asn Phe 

Ala Arg Tyr He 

120 

Val Ser Ala Cys 
135 

Val Gly Val He 
150 

Leu Ala Asn He 
165 

Phe Arg Ala Glu 

Glu Gly Lys He 

200 

Gly Asp Asn Met 
215 

Gly Glu Lys Glu 
230 

His Ser Pro Asn 
245 

Gly Tyr He Val 

Ser Val Glu Gly 

280 

Arg Gin Ala He 
295 

Ala Glu Arg Tyr 
310 



Leu Leu He Leu 
10 

Ala Ala Arg Ala 
25 

Gly Gly Gin Leu 

Phe Glu Met Thr 

60 

Ala Glu Lys Phe 
75 

Asp Leu Ser Ser 
90 

Thr Cys Asp Ala 
105 

Gly Leu Pro Ser 

Ala Thr CyB Asp 

140 

Gly Gly Gly Asn 
155 

Ala Ser Thr Val 
170 

Lys He Leu He 
185 

Val Leu His Thr 

Gly Val Thr Gly 

220 

Glu Leu Lys Leu 
235 

Thr Glu He Phe 
250 

Val Lys Ser Gly 
265 

Val Phe Ala Ala 

Thr Ser Ala Gly 

300 

Leu Asp Ala Gin 
315 



Gly Ser Gly Pro 
15 

Asn Leu Lys Pro 
30 

Thr Thr Thr Asp 
45 

Thr Gly Ser Gly 

Glu Thr Glu He 

80 

Arg Pro Phe Lys 
95 

Leu He He Ala 
110 

Glu Glu Asn Tyr 
125 

Gly Phe Phe Tyr 

Thr Ala Val Glu 

160 

His Leu lie His 
175 

Asp Arg Leu Tyr 
190 

Asp Arg Thr Leu 
205 

Leu Arg Leu Ala 

Asp Gly Leu Phe 

240 

Gin Gly Gin Leu 
255 

Leu Asp Gly Asn 
270 

Gly Asp Val Met 
285 

Thr Gly Cys Met 
Glu Ala 
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<210> 210 
<211> 311 
<2l2> PRT 

<213> Helicobacter pylori 



<400> 210 


























Ala 


Met 


lie 


Asp 


Cys 


Ala 


He 


He 


Gly 


Gly 


Gly 


Pro 


Ala 


Gly 


Leu 


Ser 


1 




5 










10 










15 




Gly 


Leu 


Tyr 


Ala 


Thr 


Arg 


Gly 


Gly 


Val 


Lys 


Asn 


Ala 


Val 


Leu 


Phe 


Glu 




20 










25 










30 






Lys 


Gly Met 


Pro 


Gly Gly 


Gin 


He 


Thr 


Gly 


Ser 


Ser 


Glu 


He 


Glu 


Asn 




35 










40 










45 




Gin 




Tyr 


Pro 


Gly Val 


Lys 


Glu 


Val 


Val 


Ser 


Gly Leu Asp 


Phe 


Met 


Pro 


50 










55 










60 






Thr 


Ala 


Trp 


Gin 


Glu 


Gin 


Cys 


Phe 


Arg 


Phe 


Gly 


Leu 


Lys 


His 


Glu 


Met 


65 








70 










75 










80 


He 


Gin 


Arg 


Val 


Ser 


Lys 


Lys 


Gly 


Ser 


His 


Phe 


Val 


He 


Leu 


Ala 


Glu 








85 










90 










95 




Asp 


Gly 


Lys 


Thr 


Phe 


Glu 


Ala 


Lys 


Ser 


Val 


He 


He 


Ala 


Thr 


Gly Gly 


100 










105 








Tyr 


110 


Gly 




Ser 


Pro 


Lys 


Arg 


Thr 


Gly 


He 


Lys 


Gly 


Glu 


Ser 


Glu 


Trp 


Lys 






115 






120 










12 5 








Gly 


Val 


Ser 


Thr 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 


Lys 


Asn 


Lys 


130 








135 










140 




Glu 




He 


Glu 


Val 


Ala 


Val 


Leu 


Gly 


Gly Gly Asp 


Thr 


Ala 


Val 


Glu 


Ala 


145 










150 










155 




_ _ • 

His 






160 


Tyr 


Leu 


Ala 


Asn 


He 


Cys 


Lys 


Lys 


Val 


Tyr 


Leu 


He 


Arg 


Arg 


Asp 








165 








170 










175 




Gly 


Phe 


Arg 


Cys 


Ala 


Pro 


He 


Thr 


Leu 


Glu 


His 


Ala 


Lys 


Asn 


Asn 


Ser 




180 










185 










190 






Lys 


He 


Glu 


Phe 


Leu 


Thr 


Pro 


Tyr 


Val 


Val 


Glu 


Glu 


He 


Lys 


Gly 


Asp 




195 










200 










205 






Glu 


Ala 


Ser 


Gly Val 


Ser 


Ser 


Leu 


Ser 


He 


Lys 


Asn 


Thr 


Ala 


Thr 


Asn 




210 










215 










220 










Lys 


Arg 


Glu 


Leu 


Val 


Val 


Pro 


Gly 


Leu 


Phe 


He 


Phe 


Val 


Gly 


Tyr 


Asp 


225 








230 










235 










240 


Val 


Asn 


Asn 


Ala 


Val 


Leu 


Lys 


Gin 


Glu 


Asp 


Asn 


Ser 


Met 


Leu 


Cys 


Glu 










245 








250 










255 




Cys 


Asp 


Glu 


Tyr 


Gly 


Ser 


He 


Val 


Val 


Asp 


Phe 


Ser 


Met 


Lys 


Thr 


Asn 




260 










265 










270 






Val 


Gin 


Gly 


Leu 


Phe 


Ala 


Ala 


Gly Asp 


He 


Arg 


He 


Phe 


Ala 


Pro 


Lys 






275 










2B0 










285 






Val 


Gin 


Val 


Val 


Cys 


Ala 


Ala 


Ser Asp Gly Ala Thr Ala 


Ala 


Leu 


Ser 




290 








295 










300 










He 


Ser 


Tyr 


Leu 


Glu 


His 


His 




















305 










310 























<210> 211 
<211> 311 
<212> PRT 

<213> Helicobacter pylori 
<400> 211 

Met He Asp Cys Ala He He Gly Gly Gly Pro Ala Gly Leu Ser Ala 

1 5 10 15 

Gly Leu Tyr Ala Thr Arg Gly Gly Val Lys Asn Ala Val Leu Phe Glu 

20 25 30 

Lys Gly Met Pro Gly Gly Gin He Thr Gly Ser Ser Glu He Glu Asn 

35 40 45 

Tyr Pro Gly Val Lys Glu Val Val Ser Gly Leu Asp Phe Met Gin Pro 

50 55 60 

Trp Gin Glu Gin Cys Phe Arg Phe Gly Leu Lys His Glu Met Thr Ala 
65 ~ 70 75 80 

Val Gin Arg Val Ser Lys Lys Asp Ser His Phe Val lie Leu Ala Glu 

85 - - 90 95 
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Asp 


Gly 


Lys 


Thr 








100 


Ser 


Pro 


Lys 


Arg 






115 




Gly Val 


Ser 


Thr 




130 






Glu 


Val 


Ala 


Val 


145 








Tyr 


Leu 


Ala 


Asn 


Gly 


Phe 


Arg 


Cys 








180 


Lys 


He 


Glu 


Phe 




195 




Ala 


Ser 


Gly 


Val 




210 






Lys 


Arg 


Glu 


Leu 


225 








Val 


Asn 


Asn 


Ala 


Cys 


Asp 


Glu 


Tyr 








260 


Val 


Gin 


Gly 


Leu 






275 




Gin 


Val 


Val 


Cys 




290 






He 


Ser 


Tyr 


Leu 


305 









Phe 


Glu 


Ala 


Lys 


Thr 


Gly 


He 


Lys 








120 


Cys 


Ala 


Thr 


Cys 




135 




Leu 


Gly 


Gly Gly 




150 






He 


Cys 


Lys 


Lys 


165 








Ala 


Pro 


He 


Thr 


Leu 


Thr 


Pro 


Tyr 








200 


Ser 


Ser 


Leu 


Ser 






215 




Val 


Val 


Pro 


Gly 




230 






Val 


Leu 


Lys 


Gin 


245 








Gly 


Ser 


lie 


Val 


Phe 


Ala 


Ala 


Gly 








280 


Ala 


Ala 


Ser 


Asp 






295 




Glu 


His 


His 






310 







Ser Val He He 
105 

Gly Glu Ser Glu 

Asp Gly Phe Phe 

140 

Asp Thr Ala Val 
155 

Val Tyr Leu He 
170 

Leu Glu His Ala 
185 

Val Val Glu Glu 

He Lys Asn Thr 

220 

Phe Phe He Phe 
235 

Glu Asp Asn Ser 
250 

Val Asp Phe Ser 
265 

Asp He Arg He 

Gly Ala Thr Ala 

3 00 



Ala Thr Gly Gly 
110 

Tyr Trp Gly Lys 
125 

Tyr Lys Asn Lys 

Glu Glu Ala He 

160 

His Arg Arg Asp 
175 

Lys Asn Asn Asp 
190 

He Lys Gly Asp 
205 

Ala Thr Asn Glu 

Val Gly Tyr Asp 

240 

Met Leu Cys Lys 
255 

Met Lys Thr Asn 
270 

Phe Ala Pro Lys 
285 

Ala Leu Ser Val 



<210> 212 
<211> 319 
<212> PRT 

<213> Listeria monocytogenes 



<400> 212 
Met Ala Ser Glu 
1 

Pro Ala Gly Met 

20 

Thr Leu Met He 
35 

Ala Glu Val Glu 
50 

Leu Ser Asp Lys 
65 

Ala Tyr Gly Asp 

Val Thr Ala Gly 

100 

Thr Gly Ala Glu 
115 

Ser Gly Arg Gly 
13 0 

Lys Asn Arg Glu 
145 

Glu Gly Thr Tyr 

Arg Arg Asp Lys 

180 

Lys Asp Glu Lys 

195 

Val Gly Asp Gly 
210 

Asp Gly Ser Glu 
225 

Gly Leu Val Pro 



Glu Lys He Tyr 
5 

Thr Ala Ala Leu 

Glu Arg Gly Val 

40 

Asn Tyr Pro Gly 
55 

Met Leu Ser Gly 
70 

He Lys Glu Val 
85 

Ser Lys Thr Tyr 

His Arg Lys Leu 

120 

Val Ser Tyr Cys 
135 

Leu He Val Val 
150 

Leu Thr Arg Tyr 
165 

Leu Arg Ala Gin 

Val Asp Phe He 

200 

Lys Lys Val Thr 
215 

Ser He Met Pro 
230 

Leu Thr. Lys Ala 



Asp Val He He 
10 

Tyr Thr Ser Arg 
25 

Pro Gly Gly Gin 

Phe Asp Ser He 

60 

Ala Lys Gin Phe 
75 

Val Asp Gly Lys 
90 

Lys Ala Arg Ala 
105 

Gly Ala Ala Gly 

Ala Val Cys Asp 

140 

Gly Gly Gly Asp 
155 

Ala Asp Lys Val 
170 

Gin He Leu Gin 
185 

Trp Asn Ser Thr 

Gly Ala Lys Leu 

220 

Val Asp Gly Val 
235 

Phe Leu Asn Leu 



He Gly Ala Gly 
15 

Ala Asp Leu Asp 
30 

Met Val Asn Thr 
45 

Leu Gly Pro Asp 

Gly Ala Glu Tyr 

80 

Glu Phe Lys Thr 
95 

He He He Ala 
110 

Glu Glu Glu Leu 
125 

Gly Ala Phe Phe 

Ser Ala Val Glu 

160 

Thr He Val His 
175 

Asp Arg Ala Phe 
190 

Val Glu Glu He 

205 

Val Ser Thr Val 

Phe He Tyr Val 

240 

Gly He Thr Asp 
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245 

Asp Glu Gly Tyr lie Val Thr Asp 

260 

Gly lie Phe Ala Ala Gly Asp Val 
275 280 
Val Thr Ala Thr Gly Asp Gly Gly 

290 295 
Tyr Val Glu Glu Leu Lys Glu Ser 
305 310 



250 255 
Glu Glu Met Arg Thr Asn Leu Pro 
265 270 
Arg Ala Lys Ser Leu Arg Gin lie 

2 85 

Leu Ala Gly Gin Asn Ala Gin Lys 

300 

Leu Glu Ala Glu Ala Ala Lys 
315 



<210> 213 
<211> 315 
<212> PRT 
<213> Mycoplasma 



genitalium 



<400> 213 
Met Leu Lys Val 
1 

Asp Leu Val lie 

20 

Tyr Gly Lys Arg 
35 

Pro Gly Gly Lys 
50 

Phe Lys Thr He 
65 

Leu Leu Ala Phe 

Asp His Leu Asn 

100 

Val Phe Ser Lys 
115 

Leu Gly He Glu 
13 0 

Cys Ala He Cys 
145 

Val Gly Gly Gly 

He Ala Lys Thr 

180 

Asp Ala Leu Leu 
195 

His Leu Asn Ala 
210 

Thr Val Lys Leu 
225 

He Asp Cys Leu 

Val Leu Asp Leu 

260 

Asp Glu Asn Met 
275 

Cys Arg Ser Lys 
2 90 

Val Thr Ala Val 
305 



Asn Ala Asp Phe 
5 

Val Gly Ala Gly 

Ala Asn Leu Asn 

40 

He Val Lys Thr 
55 

Thr Gly Pro Glu 
70 

Glu Pro Val Val 

85 

Asp Thr Phe He 

Thr Val He Tyr 

120 

Lys Glu Asp Tyr 
13 5 

Asp Ala Ala Leu 
150 

Asn Ser Ala He 
165 

Val His Leu He 

Val Glu Lys Leu 

200 

Thr Val Lys Gin 
215 

Ala Ser Thr Val 
230 

Phe Pro Tyr He 
245 

Lys Leu Asn Leu 

Gin Thr Asn He 

280 

Ser Phe Arg Gin 
295 

Leu Lys Val Arg 
310 



Leu Thr Lys Asp 
10 

Pro Ala Gly He 
25 

Leu Ala He He 

Asn He Val Glu 

60 

Leu Gly Leu Glu 
75 

Phe Tyr Asn Asn 
90 

Leu Tyr Leu Asp 
105 

Ala Thr Gly Met 

Phe Tyr Gly Lys 

14 0 

Tyr Lys Gly Lys 
155 

Gin Glu Ala He 
170 

His Arg Arg Glu 
185 

Lys Lys lie Ser 

Leu He Gly Gin 

220 

Asp Lys Ser Glu 
235 

Gly Phe Glu Ser 
250 

Asp Gin Asn Gly 
265 

Lys Gly Phe Tyr 

He Ala Thr Ala 

300 

Asp Asp He 
315 



Gin Val He Tyr 
15 

Ala Ser Ala He 
30 

Glu Gly Asn Thr 
45 

Asn Tyr Pro Gly 

Met Tyr Asn His 

80 

Leu He Lys He 
95 

Asn Lys Thr Thr 
110 

Glu Glu Arg Lys 
125 

Gly He Ser Tyr 

Thr Val Gly Val 

160 

Tyr Leu Ser Ser 
175 

Val Phe Arg Ser 
190 

Asn Val Val Phe 
205 

Glu Lys Leu Gin 

Ser Glu He Ala 

240 

Asn Asn Lys Pro 
255 

Phe He Leu Gly 
270 

Val Ala Gly Asp 

285 

He Ser Asp Gly 



<210> 214 
<211> 458 
<212> PRT 

<213> Mycobacterium leprae 

Me^As^Thr Thr Pro Ser Ala His Glu Thr He His Glu Val He Val 
1 5 10 15 
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lie 


Gly 


Ser 


Gly 


Pro 


Ala 


Gly 


Tyr 


Thr 








20 










25 


Ala 


Gin 


Leu 


Thr 


Pro 


Leu 


Val 


Phe 


Glu 






35 










40 




Leu 


Met 


Thr 


Thr 


Thr 


GlU 


Val 


Glu 


Asn 




r— r\ 

5 0 










c c 






lie 


Thr 


Gly 


Pro 


Glu 


Leu 


Met 


Asp 


Asp 


65 










/ U 








T-lt_ 

Phe 


Gly 


Ala 


Glu 


Leu 
85 


Arg 


Tnr 


Glu 


Asp 


Gly 


Pro 


lie 


Lys 


Ser 


Val 


TT_ 1 

Val 


Thr 


Ala 






100 










105 


Arg 


— — i 

Ala 


Val 


lie 


Leu 


Ala 


Met 


Gly Thr 


• 


115 










120 




Pro 


Gly 


Glu 


Gin 


Glu 


Leu 


Leu 


Gly Arg 




130 










13 5 






Cys 


Asp 


Gly 


Ser 


Phe 


Phe 


Arg 


Gly Gin 


145 










150 








Gly 


Asp 


Ser 


Ala 


Met 


Glu 


Glu 


Ala 


Leu 






165 










Ser 


Val 


Thr 


Leu 


Val 


His 


Arg 


Arg 


Asp 








180 










185 


Met 


Leu 


Gly Arg 


Ala 


Arg 


Asn 


Asn 


Asp 






195 










200 




Hxs 


Thr 


Val 


Val 


Ala 


Val 


Asn 


Gly Tyr 




210 










1 1 C 

21b 






Leu 


Arg 


Asn 


Thr 


(Ml 

Thr 


Thr 


Gly 


Glu 


Glu 


225 










230 








Val 


Phe 


Val 


Ala 


lie 


Gly 


His 


Glu 


Pro 










245 








Val 


Val 


Asp 


lie 


Asp 


Pro 


Asp 


Gly 


Tyr 








260 










265 


Thr 


Ser 


Thr 


Ser 


Met 


Asp 


Gly 


Val 


Phe 






275 










280 




Arg 


Thr 


Tyr 


Arg 


Gin 


"A 1 _ 

Ala 


T 1 _ 

He 


Thr 


Ala 




290 










2 95 






Ala 


lie 


Asp 


Ala 


Glu 


Arg 


Trp 


Leu 


Ala 


"4 f\ t~ 

305 










310 








Asn 


Glu 


Thr 


Thr 


Glu 


Glu 


Thr 


Gly Asp 










325 










Asp 


Trp 


Ser 


Thr 


Ala 


Met 


Thr 


Asp 


Ala 








340 










345 


Glu 


Val 


Thr Asp 


Ala 


Ser 


Phe 


Phe 


Ala 






355 










360 




Pro 


Val 


Leu 


Val 


Asp 


Phe 


Trp 


Ala 


Thr 




370 










375 






Val 


Ala 


Pro 


Val 


Leu 


Glu 


Glu 


He 


Ala 


385 










390 








Thr 


Val 


Ala 


Lys 


Leu 


Asp 


Val 


Asp 


Thr 










405 










Phe 


Gin 


Val 


Val 


Ser 


He 


Pro 


Thr 


Met 








420 










425 


Pro 


Val 


Lys 


Arg 


lie 


Val 


Gly 


Ala 


Lys 






435 










440 




Asp 


Leu 


Ser Asp 


Val 


Val 


Pro 


Asn 


Leu 



450 455 



<210> 215 
<211> 315 
<212> PRT 

<213> Mycoplasma pneumoniae 
<400> 215 

Met Leu Lys Val Lys Ser Asp Phe Leu 

1 5 ^ 

Asp Val Ala He Val Gly Ala Gly Pro 
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r*l tr 






D U 
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vai 




lieu 


-rtx y 


q n 












Glu 
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Val 
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inr 
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14 0 
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He 
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Val 
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IJ.6 
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v^iy 
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Arg 
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Arg 


1/0 
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Tl 
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TV 
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mr 
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O 1 c 
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Asp 


250 














Val 
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Oly 


Arg 


inr 
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Asp 
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"3 1 C 
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J ^ u 
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Ser 


Thr 
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Lys 


Asn 


Ala 


Gly 
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Tnr 
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J s u 






Asp 


Val 
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oer 


T\ n 

Asn 


Lys 
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Trp 


Cys 
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Pro 


Cys 


Lys 


Met 






380 










Ser 


Glu 


Gin 


Arg 


Asn 


Gin 


Leu 




395 










400 


Asn 


Pro 


Glu 


Met 


Ala 


Arg 


Glu 


410 










415 




He 


Leu 


Phe 


Gin 


Gly 


Gly 


Gin 










430 






Gly 


Lys 


Ala 


Ala 


Leu 


Leu 


Arg 






a a <=; 

ft *4 O 








Asn 














Thr 


Lys 


Asp 


Gin 
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He 


Tyr 
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Ala 


Gly 


He 


Ala 


Ala 


Gly 


He 
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Tyr 


Gly 


Lys 


Arg 






35 




Pro 


Gly 


Gly 


Lys 




50 






Tyr 


Lys 


Ser 


He 


65 








Leu 


lie 


Asp 


Leu 


Asp 


Lys 


Ala 


Ala 








100 


Val 


Phe 


Ala 


Lys 






115 




Leu 


Gly 


Val 


Ala 




130 






Cys 


Ala 


He 


Cys 


145 








Val 


Gly 


Gly Gly 


Met 


Ala 


Lys 


Thr 








180 


Asp 


Glu 


Thr 


Ala 




195 




His 


Leu 


Asn 


Tyr 




210 






Gly 


He 


Val 


Leu 


225 








Leu 


Asn 


Cys 


Val 


Val 


Glu 


His 


Leu 








260 


Asn 


Glu 


Gin 


Met 






275 




Cys 


Arg 


Ser 


Lys 




290 






lie 


He 


Ala 


Val 


305 









Ala Asn Leu Asn 

40 

Val Val Lys Thr 
55 

Thr Gly Pro Asp 
70 

Glu Pro Thr Phe 
85 

Asp Thr Phe He 

Thr Val He Tyr 

120 

Lys Glu Asp His 
135 

Asp Gly Ser Leu 
150 

Asn Ser Ala He 
165 

Val His Leu He 

Leu Asn Lys Leu 

200 

Thr Val Lys Glu 
215 

Gin Asn Thr Leu 
230 

Phe Pro Tyr He 
245 

Asn Leu Lys Leu 

Glu Thr Ser Leu 

2B0 

His Phe Arg Gin 
295 

Leu Thr He Arg 
310 



25 

Leu Ala He He 

Asn He Val Glu 

60 

Leu Gly Leu Glu 
75 

Phe Tyr Ala Asn 
90 

Leu Tyr Leu Asp 
105 

Ala Thr Gly Met 

Phe Tyr Gly Lys 

140 

Tyr Lys Asp Gin 
155 

Gin Glu Ala Leu 
170 

His Arg Arg Glu 
185 

Arg Asn Leu Pro 

Leu Leu Gly Asn 

220 

Asp His Ser Thr 
235 

Gly Phe Glu Ser 
250 

Asp Pro Gin Gly 
265 

Lys Gly Leu Phe 

.lie Gly Thr Ala 

300 

Asp Val Leu 
315 



30 

Glu Gly Ser Thr 
45 

Asn Tyr Pro Gly 

Met Tyr Asn His 

80 

Leu He Lys Leu 
95 

Asp Lys Thr Val 
110 

Leu Glu Arg Lys 
125 

Gly He Ser Tyr 

Val Val Gly Val 

160 

Tyr Leu Ala Ser 
175 

Gly Phe Arg Ala 
190 

Asn Val Val Phe 
205 

Asn Thr Leu Asn 

Lys Gin He Asp 

240 

He Thr Lys Pro 

255 

Phe Leu He Thr 
270 

Ala Ala Gly Asp 
285 

He Asn Asp Gly 



<210> 216 
<211> 311 
<212> PRT 

<213> Mycobacterium smegmatis 



<400> 216 
Met Ser Thr Ser 
1 

Pro Ala Gly Tyr 

20 

Pro Leu Val Phe 
35 

Thr Glu Val Glu 
50 

Glu Leu Met Asp 
65 

Leu Arg Met Glu 

Thr Val Val Val 

100 

Ala Met Gly Ala 
115 

Leu Thr Gly Met 
130 

Phe Arg Asp Gin 
145 

Glu Glu Ala Thr 



Gin Thr Val His 
5 

Thr Ala Ala He 

Glu Gly Thr Gin 

40 

Asn Tyr Pro Gly 
55 

Gin Met Arg Glu 
70 

Asp Val Asp Ala 
85 

Gly Asp Glu Thr 

Ala Ala Arg His 

120 

Gly Val Ser Thr 
135 

Asp He Val Val 
150 

Phe Leu Thr Arg 
165 



Asp Val He He 
10 

Tyr Ala Ala Arg 
25 

Phe Gly Gly Ala 

Phe Arg Glu Gly 

60 

Gin Ala Leu Arg 
75 

Val Gin Leu Glu 
90 

His Gin Ala Arg 
105 

Leu Gly Val Pro 

Cys Ala Thr Cys 

14 0 

Val Gly Gly Gly 
155 

Phe Ala Arg Ser 
170 



He Gly Ser Gly 
15 

Ala Gin Leu Lys 
30 

Leu Met Thr Thr 
45 

He Thr Gly Pro 

Phe Arg Ala Asp 

80 

Gly Pro Val Lys 
95 

Ala Val He Leu 
110 

Gly Glu Glu Ala 
125 

Asp Gly Phe Phe 

Asp Ser Ala Met 

160 

Val Thr Leu He 
17 5 
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Val 
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245 








250 
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Val 
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Leu 




260 










265 








270 




Gin 


Asp 
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His 
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Arg 


275 










280 
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Ser 
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Cys Ala 


Ala 


Ser 


lie 


Asp 
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Glu 




290 








295 








300 
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<210> 217 
<211> 335 
<212> PRT 

<213> Mycobacterium tuberculosis 



<400> 217 
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Asp 


1 














10 








15 




val 


lie 


va J. 


Tl - 

± _L.e 


laJLy 


O ■v* 
l5CX 




Pro 


Ala 


Gly 


Tyr Thr 


Ala 


Ala 


Leu 


Tyr 








2 0 






25 








30 




Phe 


Ala 


Ala 


Arg 


Ala 


Gin 


Leu 


Ala 


Pro 


Leu 


Val 


Phe Glu 


Gly 


Thr 


Ser 






35 










40 








45 




Gly 


Phe 


Gly Gly 


Ala 


Leu 


Met 


Thr 


Thr 


Thr 


Asp 


Val 


Glu Asn 


Tyr 


Pro 




50 










55 








60 






Glu 


Gin 


Arg 


Asn 


Gly 


lie 


Thr 


Gly 


Pro 


Glu 


Leu 


Met 


Asp Glu 


Met 


Arg 


65 








70 










75 








80 


Ala 


Leu 


Arg 


Phe 


Gly 


Ala 


Asp 


Leu Arg 


Met 


Glu Asp 


Val 


Glu 


Ser 


Val 








85 










90 








95 




Ser 


Leu 


His 


Gly 
100 


Pro 


Leu 


Lys 


Ser 


Val 
105 


Val 


Thr Ala 


Asp 


Gly 
110 


Gin 


Thr 


His 


Arg 


Ala 


Arg 


Ala 


Val 


He 


Leu 


Ala 


Met 


Gly Ala 


Ala 


Ala 


Arg 


Tyr 




115 








120 








125 








Leu 


Gin 


Val 


Pro 


Gly 


Glu 


Gin 


Glu Leu Leu Gly Arg Gly Val 


Ser 


Ser 




130 








135 








140 






Ala 


Val 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Phe 


Phe 


Phe 


Arg 


Asp Gin 


Asp 


He 


145 






150 










155 








160 


lie 


Gly 


Gly 


Gly 


Asp 


Ser 


Ala 


Met 


Glu 


Glu 


Ala Thr 


Phe 


Leu 


Thr 


Arg 




165 










170 








175 




Phe 


Ala 


Arg 


Ser 


Val 


Thr 


Leu 


Val 


His 


Arg 


Arg Asp 


Glu 


Phe 


Arg 


Ala 






180 










185 








190 




Phe 


Ser 


Lys 


lie 


Met 


Leu 


Asp 


Arg 


Ala 


Arg 


Asn 


Asn Asp 


Lys 


He 


Arg 




195 










200 








205 






Thr 


Leu 


Thr 
210 


Asn 


His 


Thr 


Val 


Val 
215 


Ala 


Val 


Asp 


Gly Asp 
220 


Thr 


Thr 


Val 


Gly 


Leu 


Arg 


Val 


Arg 


Asp 


Thr 


Asn 


Thr 


Gly Ala Glu Thr 


Thr 


Leu 


Pro 


225 








230 










235 






Gly 


240 


Val 


Thr 


Gly 


Val 


Phe 


Val 


Ala 


He 


Gly 


His 


Glu Pro 


Arg 


Ser 


Leu 








245 










250 








255 




Val 


Arg 


Glu 


Ala 


lie 


Asp 


Val 


Asp 


Pro 


Asp 


Gly Tyr 


Val 


Leu 


Val 


Gin 






260 










265 








270 






Gly Arg 


Thr 


Thr 


Ser 


Thr 


Ser 


Leu 


Pro 


Gly 


Val Phe 


Ala 


Ala 


Gly Asp 






275 










280 








285 








Leu 


Val 


Asp 


Arg 


Thr 


Tyr 


Arg 


Gin 


Ala 


Val 


Thr Ala Ala Gly Ser Gly 




290 






295 








300 








Ala 


Cys 


Ala 


Ala 


Ala 


He 


Asp 


Ala 


Glu Arg 


Trp 


Leu Ala 


Glu 


His 


Ala 


305 










310 










315 








320 


Thr 


Gly 


Glu 


Ala 


Asp 


Ser 


Thr 


Asp 


Ala 


Leu 


He Gly Ala 


Gin 


Arg 
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<210> 218 
<211> 334 
<212> PRT 

<213> Neurospora crassa 



<400> 218 



Met 


His 


Ser 


Lvs 


Val 


Val 


He 


He 


Gly 


Ser 


Gly 


Pro 


Ala 


Ala 


His 


Thr 


1 






5 










10 










15 




Ala. 


Ala 


He 


Tvr 


Leu 


Ala 


Arg 


Ala 


Glu 


Leu 


Lvs 


Pro 


Val 


Leu 


Tvr 


Glu 








20 










25 










30 






Glv 


Phe 


Met 


Ala 


Asn 


Gly 


He 


Ala 


Ala 


Gly 


Gly 


Gin 


Leu 


Thr 


Thr 


Thr 






35 










40 










45 








Thr 


Glu 


He 


Glu 


Asn 


Phe 


Pro 


Gly 


Phe 


Pro 


Asp 


Gly 


He 


Met 


Gly 


Gin 




50 










55 










60 










Glu 


Leu 


Met 


Asp 


Lys 


Met 


Lys 


Ala 


Gin 


Ser 


Glu 


Arg 


Phe 


Gly 


Thr 


Gin 


65 










70 










75 










80 


lie 


He 


Ser 


Glu 


Thr 


Val 


Ala 


Lys 


Val 


ABp 


Leu 


Ser 


Ala 


Arg 


Pro 


Phe 










85 










90 










95 




Lys 


Tyr 


Ala 


Thr 


Glu 


Trp 


Ser 


Pro 


Glu 


Glu 


Tyr 


His 


Thr 


Ala 


Asp 


Ser 






100 










105 










110 






lie 


He 


Leu 


Ala 


Thr 


Gly Ala 


Ser 


Ala Arg 


Arq 


Leu 


His 


Leu 


Pro 


Gly 






115 










120 










125 








Glu 


Glu 


Lys 


Tyr 


Tro 


Gin 


Asn 


Glv 


He 


Ser 


Ala 


Cys 


Ala 


Val 


Cys 


Asp 




130 










135 










140 










Glv 


Ala 


Val 


Pro 


He 


Phe 


Arg 


Asn 


Lys 


His 


Leu 


Val 


Val 


He 


Gly 


Glv 


145 










150 










155 










160 


Gly 


Asp 


Ser 


Ala 


Ala 


Glu 


Glu 


Ala 


Met 


Tyr 


Leu 


Thr 


Lys 


Tyr 


Gly 


Ser 










165 










170 










175 




His 


Val 


Thr 


Val 


Leu 


Val 


Arg 


Lys 


Asp 


Lys 


Leu 


Arg 


Ala 


Ser 


Ser 


He 








180 










185 










190 






Met 


Ala 


His 


Arg 


Leu 


Leu 


Asn 


His 


Glu 


Lys 


Val 


Thr 


Val 


Arg 


Phe 


Asn 






195 










200 










205 








Thr 


Val 


Gly Val 


Glu 


Val 


Lys 


Gly 


Asp 


Asp 


Lys 


Gly 


Leu 


Met 


Ser 


His 




210 










215 










220 










Leu 


Val 


Val 


Lys 


Asp 


Val 


Thr 


Thr 


Gly 


Lys 


Glu 


Glu 


Thr 


Leu 


Glu 


Ala 


225 










230 










235 










240 


Asn 


Gly 


Leu 


Phe 


Tyr 


Ala 


He 


Gly 


His 


Asp 


Pro 


Ala 


Thr 


Ala 


Leu 


Val 










245 










250 










255 




Lys 


Gly 


Gin 


Leu 


Glu 


Thr 


Asp 


Ala 


Asp 


Gly 


Tyr 


Val 


Val 


Thr 


Lys 


Pro 








260 










265 










270 






Gly 


Thr 


Thr 


Leu 


Thr 


Ser 


Val 


Glu 


Gly 


Val 


Phe 


Ala 


Ala 


Gly Asp 


Val 






275 










280 










285 








Gin 


Asp 


Lys 


Arg 


Tyr 


Arg 


Gin 


Ala 


He 


Thr 


Ser 


Ala 


Gly 


Thr 


Gly 


Cys 




290 










295 










300 










Met 


Ala 


Ala 


Leu 


Asp 


Ala 


Glu 


Lys 


Phe 


Leu 


Ser 


Glu 


His 


Glu 


Glu 


Thr 


305 








310 








315 










320 


Pro 


Ala 


Glu 


His 


Arg 


Asp 


Thr 


Ser 


Ala 


Val 


Gin 


Gly Asn 


Leu 














325 










330 















<210> 219 
<211> 333 
<212> PRT 

<213> Penicillium chrysogenum 
<400> 219 

Val His Ser Lys Val Val He He Gly Ser Gly Ala Gly Ala His Thr 

1 5 10 15 

Ala Ala He Tyr Leu Ser Arg Ala Glu Leu Gin Pro Val Leu Tyr Glu 

20 ~ 25 3 0 

Gly Met Leu Ala Asn Gly Thr Ala Ala Gly Gly Gin Leu Thr Thr Thr 

35 40 45 

Thr Asp Val Glu Asn Phe Pro Gly Phe Pro Ser Gly He Gly Gly Ala 
50 55 ^ 60 
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Glu 




M<=>h 




Ac? r-i 


Met 


Ara Ala 


65 








70 




lie 


lie 

JL. -1— 


Thr 


Glu 


Thr 


He 


Ser Lvs 










85 






Lvs 


Met 


Trp 


Thr 


Glu 


Trn 


Asn Asd 








100 








Th"r 


Ala 


Asp 


Ala 


Val 

V U —I— 


He 


lie Ala 






115 








120 


Asn 

iiD xx 




Pro 


Gly 


Glu 

W J- Li 


Glu 


Thr Tvr 




130 










135 


Ala 


V d JL 


Cys 


Asp 


Glv 


Ala 

AJ. CI. 


Va 1 Pro 


145 

-L. -_J 










150 




Val 


T 1 e» 


Gly Gly 


Gl v 




p» -»- Ala 










165 








TVr 


Gly 


Ser 


i_J C. -I_ 


Val 


Thr Val 

1 1 1 J- V CX JL 








180 








Ala 




Asn 


He 


Met 


Ala 


Asn Ara 

M -**X- 






195 








200 


Val 


Aro 


Phe 


Asn 




Val 


Ala Thr 




210 










215 


Asn 


Glv 


Leu 


Met 


Thr 


His 


Leu Ara 

X^J X IX 


225 








23 0 




Glu 


Glu 


Val 


Val 


Glu 


Ala 


Asn Gly 










245 






Pro 


Ala 


Ser Gly 


Leu 


Val 


Lys Gly 








260 








Tyr 


lie 


He 


Thr 


Lys 


Pro 


Gly Thr 






275 








280 


Phe 


Ala 


Cys 


Gly 


Asp 


Val 


Gin Asp 




290 










295 


Ser 


Ala 


Gly 


Ser 


Gly 


Cys 


Val Ala 


305 










310 




Ala 


Glu 


Thr 


Glu 


Thr 


His 


Gin Glu 



325 



<210> 220 

<211> 310 

<212> PRT 

<213> Rickettsia prowazekii 



<400> 220 



Met 


Lys 


He 


Thr 


Thr 


Lys 


Val 


Leu 


1 








5 








Leu 


Ser 


Ala 


Ala 


He 


Tyr 


Thr 


Ala 








20 








He 


Asn 


Gly 


Met 


Gin 


Pro 


Gly Gly 






35 










40 


Glu 


Asn 
50 


Tyr 


Pro 


Gly 


Phe 


Ala 
55 


Glu 


Glu 


Gin 


Met 


Ser 


Met 


Gin 


Ala 


Lys 


65 










70 






Asp 


Tyr 


Val 


Glu 


Arg 
85 


Val 


Asp 


Leu 


Thr 


Gly 


Thr 


Gly 
100 


Asn 


Glu 


Tyr 


Glu 


Gly 


Ala 


Glu 
115 


Ser 


Lys 


Trp 


Leu 


Gly 
120 


Gly 


Phe 
130 


Gly 


Val 


Ser 


Ser 


Cys 
13 5 


Ala 


Asn 


Gin 


Glu 


He 


Val 


Val 


Val 


Gly 


145 










150 






Ala 


Leu 


Tyr 


Leu 


Thr 


Asn 


His 


Ala 








165 








Arg 


Asn 


Ser 


Phe 
180 


Arg 


Ala 


Glu 


Lys 


Asn 


Pro 


Lys 


He 


Ser 


Val 


He 


Trp 



Gin 


Ser 


Glu 

VJ J* *-X 

75 


Ara* 


Phe 


Gly 


Thr 


Glu 
80 


Leu Asp 


Leu 


Ser 


Ser Arg 


Pro 


Phe 




90 










95 




Asp 


Glu 


Gly 


Ser 


Glu 


Pro 


Val 


Ara 


105 










110 






Thr 


Gly 


Ala 


Asn 

X X*-S X X 


Ala 
125 

*X» X*« 


Arg 


Ara 


Leu 


Trp 


Gin 


Asn 

X X *— * XX 


Glv 
140 


He 


Ser 


Ala 


Cvs 


He 


Phe 


Ara 
155 


Asn 


Lys 


Pro 


Leu 


Tvr 
160 


Ala 


Glu 
170 


Glu 

WX> 


Ala 


Met 


Phe 


Leu 

X.I "*«^ lA 

175 


Ala 


Leu 


Val 




Lvs 


Asp 


Lys 


Leu 


Ara 


185 










190 






Leu 


Leu 


Ala 


His 


Pro 
205 


Lys 


Cys 


Lys 


Glu 


Val 


He 


Glv 
220 


Glu 


Asn 


Lys 


Pro 


Val 


Lys 


Asp 
235 


Val 


Leu 


Ser 


Asn 


Ala 
240 


Leu 


Phe 
250 


Tvr 


Ala 


Val 


Gly 


His 
255 


Asp 


Gin 


Val 


Glu 


Leu 


Asp Asp 


Glu 


Gly 


265 










270 






Ser 


Phe 


Thr 


Asn 


Val 


Glu 


Gly Val 










285 








Lys 


Arg 


Tyr 


Arg 
300 


Gin 


Ala 


He 


Thr 


Ala 


Leu 


Glu 


Ala 


Glu 


Lys 


Phe 


He 






315 








320 


Ala 


Lys 
330 


Pro 


Val 


Leu 









He 


He 


Gly 


Ser Gly 


Pro 


Ala 


Gly 




10 










15 




Arg 


Ser 


Ala 


Leu 


Lys 


Pro 


He 


Leu 


25 










30 






Gin 


Leu 


Thr 


Met 


Thr 


Thr 


Asp 


Val 










45 








Thr 


He 


Gin 


Gly 
60 


Pro 


Trp 


Leu 


Met 


Asn 


Val 


Gly 


Thr 


Glu 


He 


He 


Ser 






75 










80 


Ser 


Lys 


Arg 


Pro 


Phe 


Lys 


He 


Phe 




90 










95 




Ala Asp 


Ser 


He 


He 


He 


Cys 


Thr 


105 










110 






He 


Ala 


Ser 


Glu 


Gin 


Glu 


Phe 


Arg 










125 








He 


Cys 


Asp 


Gly 


Phe 


Phe 


Phe 


Lys 








140 










Gly Gly 


Asn 


Ser 


Ala 


Leu 


Glu 


Glu 






155 










160 


Asn 


Lys 


Val 


Thr 


Val 


Val 


His 


Arg 




170 










175 




He 


Leu 


Gin 


Asp 


Arg 


Leu 


Phe 


Lys 


185 










190 






Asp 


His 


He 


He 


Asp 


Glu 


He 


Val 
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195 200 205 



Gly 


Ser 


Asn Lys 


Pro 


Lys 


Ala 


Val 


Thr 


Gly Val Lys 


lie 


Gin 


Asn 


Val 




210 








215 






220 










Tyr 


Thr 


Asn Glu 


lie 


Asn 


Leu 


Val 


Asn 


Cys Ser Gly Val 


Phe 


He 


Ala 


225 








230 








235 








240 


lie 


Gly 


His Ala 


Pro 


Asn 


Thr 


Ala 


Leu 


Phe Lys Gly 


Gin 


lie 


Ala 


He 








245 










250 






255 




Asp 


Asp 


Asp Asn 


Tyr 


lie 


Val 


Thr 


Gin 


Ser Gly Ser 


Thr 


Arg 


Thr 


Asn 






260 










265 






270 






Val 


Glu 


Gly Val 


Phe 


Ala 


Ala Gly Asp Val Gin Asp 


Lys 


He 


Tyr 


Arg 






275 








280 






285 








Gin 


Ala 


Val Thr 


Ala 


Ala 


Ala 


Ser 


Gly 


Cys Met Ala 


Ala 


Leu 


Glu 


Val 




290 








295 




300 










Ala 


Lys 


Phe Leu 


Asn 


Lys 


















305 








310 



















<210> 221 
<211> 322 
<212> PRT 

<213> Schizosaccharomyces pombe 



<400> 221 



Met 


Thr 


His 


Asn 


Lys 


Val 


Val 


He 


He 


Gly 


Ser Gly 


Pro 


Ala 


Gly 


His 


1 








5 










10 










15 




Thr 


Ala 


Ala 


He 


Tyr 


Leu 


Ala 


Arg 


Gly 


Glu 


Leu 


Lys 


Pro 


Val 


Met 


Tyr 








20 










25 










3-0 






Glu 


Gly 


Met 


Leu 


Ala 


Asn 


Gly 


He Ala Ala Gly Gly Gin Leu 


Thr 


Thr 






35 










40 










45 








Thr 


Thr 


Asp 


Val 


Glu 


Asn 


Phe 


Pro 


Gly 


Phe 


Pro 


Asp 


Gly 


He 


Asn 


Gly 




50 










55 










60 










Thr 


Thr 


Leu 


Thr 


Glu 


Asn 


Phe 


Arg 


Ala 


Gin 


Ser 


Leu 

• 


Arg 


Phe 


Gly 


Thr 


65 










70 










75 










80 


Glu 


lie 


He 


Thr 


Glu 


Thr 


Val 


Ser 


Lye 


Leu 


Asp 


Leu 


Ser 


Ser 


Arg 


Pro 










85 










90 










95 




Phe 


Lys 


Tyr 


Trp 


Leu 


Glu 


Gly 


Ala 


Glu 


Glu 


Glu 


Glu 


Pro 


His 


Thr 


Ala 








100 










105 










110 






Asp 


Ser 


Val 


He 


Leu 


Ala 


Thr 


Gly Ala 


Ser 


Ala 


Arg 


Arg 


Leu 


His 


He 






115 










120 










125 








Thr Gly 


Glu 


Asp 


Thr 


Tyr 


Trp 


Gin 


Ala 


Gly 


He 


Ser 


Ala 


Cys 


Ala 


Val 




130 










135 










140 










Cys 


Asp 


Gly Ala 


Val 


Pro 


He 


Tyr 


Arg 


Asn 


Lys 


Pro 


Leu 


Ala 


Val 


Val 


145 










150 










155 










160 


Gly Gly Gly Asp 


Ser 


Ala 


Ala 


Glu 


Glu 


Ala 


Gin 


Phe 


Leu 


Thr 


Lys 


Tyr 










165 










170 










175 




Gly 


Ser 


Lys 


Val 


Tyr 


Val 


Leu 


Val 


Arg 


Arg 


Asp 


Lys 


Leu 


Arg 


Ala 


Ser 








180 










185 










190 






Pro 


He 


Met 


Ala 


Lys 


Arg 


Leu 


Leu 


Ala 


Asn 


Pro 


Lys 


Val 


Glu 


Val 


Leu 






195 










200 










205 








Trp 


Asn 


Thr 


Val 


Ala 


Glu 


Glu 


Ala 


Gin 


Gly Asp 


Gly 


Lys 


Leu 


Leu 


Asn 




210 










215 










220 










Asn 


Leu 


Arg 


He 


Lys 


Asn 


Thr 


Asn 


Thr 


Asn 


Glu 


Val 


Ser 


Asp 


Leu 


Gin 


225 










230 










235 








240 


Val 


Asn 


Gly Leu 


Phe 


Tyr 


Ala 


He 


Gly 


His 


He 


Pro 


Ala 


Thr 


Lys 


Leu 










245 










250 










255 




Val 


Ala 


Glu 


Gin 


He 


Glu 


Leu 


Asp 


Glu Ala Gly Tyr 


He 


Lys 


Thr 


He 








260 










265 










270 






Asn Gly Thr Pro 


Arg 


Thr 


Ser 


He 


Pro 


Gly 


Phe 


Phe 


Ala 


Ala 


Gly 


Asp 






275 










280 










285 








Val 


Gin Asp Lys 


Val 


Phe 


Arg 


Gin 


Ala 


He 


Thr 


Ser 


Ala 


Gly 


Ser 


Gly 




290 










295 










300 










Cys 


Gin 


Ala 


Ala 


Leu 


Leu 


Ala 


Met 


His 


Tyr 


Leu 


Glu 


Glu 


Leu 


Glu 


Asp 


305 










310 








315 










320 



Thr Asp 
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<210> 222 
<211> 321 
<212> PRT 

<213> StreptomyceB clavuligerus 
<400> 222 

Ser Asp Val Arg Asn Val He He He Gly Ser Gly Pro Ala Gly Tyr 

15 10 15 

Thr Ala Ala Leu Tyr Thr Ala Arg Ala Ser Leu Gin Pro Leu Val Phe 

20 25 30 

Glu Gly Ala Val Thr Ala Gly Gly Ala Leu Met Asn Thr Thr Asp Val 

35 40 45 

Glu Asn Phe Pro Gly Phe Arg Asp Gly He Met Gly Pro Asp Leu Met 

50 55 60 

Asp Asn Met Arg Ala Gin Ala Glu Arg Phe Gly Ala Glu Leu He Pro 
65 * 70 75 80 

Asp Asp Val Val Ser Val Asp Leu Thr Gly Asp He Lys Thr Val Thr 

85 90 95 

Asp Ser Ala Gly Thr Val His Arg Ala Lys Ala Val He Val Thr Thr 

100 105 HO 

Gly Ser Gin His Arg Lys Leu Gly Leu Pro Arg Glu Asp Ala Leu Ser 

115 120 125 

Gly Arg Gly Val Ser Trp Cys Ala Thr Cys Asp Gly Phe Phe Phe Lys 

13 0 135 I 40 

Asp Gin Asp He Val Val Val Gly Gly Gly Asp Thr Ala Met Glu Glu 
145 150 155 160 

Ala Thr Phe Leu Ser Arg Phe Ala Lys Ser Val Thr He Val His Arg 

165 170 175 

Arq Asp Ser Leu Arg Ala Ser Lys Ala Met Gin Asp Arg Ala Phe Ala 

180 185 190 

Asp Pro Lys He Ser Phe Ala Trp Asn Ser Glu Val Ala Thr He His 

195 200 205 

Gly Glu Gin Lys Leu Thr Gly Leu Thr Leu Arg Asp Thr Lys Thr Gly 

210 215 220 

Glu Thr Arg Glu Leu Ala Ala Thr Gly Leu Phe He Ala Val Gly His 
225 230 235 240 

Asp Pro Arg Thr Glu Leu Phe Lys Gly Gin Leu Asp Leu Asp Asp Glu 

245 250 255 

Gly Tyr Leu Lys Val Ala Ser Pro Ser Thr Arg Thr Asn Leu Thr Gly 

260 265 270 

Val Phe Ala Ala Gly Asp Val Val Asp His Thr Tyr Arg Gin Ala He 

275 * 280 285 

Thr Ala Ala Gly Thr Gly Cys Ser Ala Ala Leu Asp Ala Glu Arg Tyr 

290 ~ 295 300 

Leu Ala Ala Leu Ala Asp Ser Glu Gin He Ala Glu Pro Ala Pro Ala 
305 310 315 320 

Val 



<210> 223 
<211> 321 
<212> PRT 

<213> Streptomyces coelicolor 

<400> 223 , 

Ser Asp Val Arg Asn Val He He He Gly Ser Gly Pro Ala Gly Tyr 

1 ~ 5 10 15 

Thr Ala Ala Leu Tyr Thr Ala Arg Ala Ser Leu Lys Pro Leu Val Phe 

20 * 25 30 

Glu Gly Ala Val Thr Ala Gly Gly Ala Leu Met Asn Thr Thr Glu Val 

35 40 45 

Glu Asn Phe Pro Gly Phe Gin Asp Gly He Met Gly Pro Glu Leu Met 

50 55 60 

Asp Asn Met Arg Ala Gin Ala Glu Arg Phe Gly Ala Glu Leu He Pro 
65 70 75 80 

Asp Asp Val Val Ala Val Asp Leu Ser Gly Glu He Lys Thr Val Thr 
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Asp Thr Ala Gly 

100 

Gly Ser Gin His 
115 

Gly Arg Gly Val 
130 

Asp Gin Asp lie 
145 

Ala Thr Phe Leu 

Arg Asp Thr Leu 

180 

Asp Pro Lys lie 
195 

Gly Asp Gin Lys 
210 

Glu Leu Ser Asp 
225 

Asp Pro Arg Thr 

Gly Tyr Leu Lys 

2 60 

Val Phe Gly Ala 
275 

Thr Ala Ala Gly 
290 

Leu Ala Ala Leu 

305 

Val 



85 








Thr 


Val 


His 


Arg 


Arg 


Lys 


Leu Gly 








i "> n 
xz u 


Ser 


Trp 


Cys 


ax a 






135 




Ala 


Val 


lie 


Gly 




150 






Ser 


Arg 


Phe 


Ala 


ID J 








Arg 


Ala 


Ser 


Lys 


Ser 


Phe 


Val 


Trp 








200 


Leu 


Ala 


Gly 


Leu 






215 




Leu 


Pro 


Val 


Thr 




230 






Glu 


Leu 


Phe 


Lys 


245 








Val 


Asp 


Ala 


Pro 


Gly Asp 


Val 


Val 








280 


Thr 


Gly 


Cys 


Ser 






295 




Ser 


Asp 


Glu 


Asp 




310 







90 

Ala Lys Ala Val 
105 

Leu Pro Asn Glu 

Thr Cys Asp Gly 

140 

Gly Gly Asp Thr 
155 

Lys Ser Val Thr 
170 

Ala Met Gin Glu 
185 

Asp Ser Glu Val 

Lys Leu Arg Asn 

220 

Gly Leu Phe lie 
235 

Gly Gin Leu Asp 
250 

Ser Thr Arg Thr 
265 

Asp His Thr Tyr 

Ala Ala Val Asp 

300 

Lys Ala Glu Pro 
315 



95 

lie Val Thr Thr 
110 

Asp Ala Leu Ser 
125 

Phe Phe Phe Lys 

Ala Met Glu Glu 

160 

He Val His Arg 
175 

Arg Ala Phe Ala 
190 

Ala Glu Val Gin 
205 

Val Lys Thr Gly 

Ala lie Gly His 

240 

Leu Asp Pro Glu 
255 

Asn Leu Thr Gly 
270 

Arg Gin Ala- lie 
285 

Ala Glu Pro Phe 

Glu Lys Thr Ala 

320 



<210> 224 
<211> 307 
<212> PRT 
<213> Treponema 

<400> 224 
Met Glu Thr Asp 
1 

Leu Ser Ala Ala 

20 

He Glu Ser Lys 

35 

Glu Asn Tyr Pro 
50 

Glu Asn Met Lys 
65 

Glu Glu Val Thr 

Thr Gly Thr Gly 

100 

Ala Glu His Arg 
115 

Arg Gly Val Ser 
13 0 

Lys His Val Val 
145 

Leu Val Leu Ser 

Asp Thr Leu Arg 

180 

Pro His He Ala 
195 

Glu Thr Lys Val 
210 



pallidium 



Tyr Asp Val He 
5 

Gin Tyr Ala Cys 

Ala His Gly Gly 

40 

Gly Tyr Ala Thr 
55 

Lys Gin Ala Val 
70 

Thr He Gly Lys 
85 

Ala Tyr Thr Ala 

Lys Met Gly He 

120 

Tyr Cys Ala Thr 
135 

Val He Gly Gly 
150 

Arg Leu Thr Asp 
165 

Ala Gin Lys Ala 

Val Gin Trp Asn 

200 

Ser Ser Val Leu 
215 



He Val Gly Ala 
10 

Arg Ala Asn Leu 
25 

Gin Ala Leu Leu 

Pro He Ser Gly 

60 

Ala Phe Gly Ala 
75 

Arg Asp Ser Val 
90 

Met Ser Val He 
105 

Pro Gly Glu Ser 

Cys Asp Gly Pro 

140 

Gly Asp Ala Ala 
155 

Arg Val Thr Met 
170 

He Ala Glu Arg 
185 

Thr Thr Leu Glu 

Leu Lys Asp Val 

220 



Gly Ala Ala Gly 
15 

Arg Thr Leu Val 
30 

He Asp Ser Leu 
45 

Phe Glu Tyr Ala 

Gin He Ala Tyr 

80 

Phe His He Thr 
95 

Leu Ala Thr Gly 
110 

Glu Phe Leu Gly 
125 

Phe Phe Arg Asn 

Cys Asp Glu Ser 

160 

He His Arg Arg 
175 

Thr Leu Lys Asn 
190 

Ala Val Arg Gly 
205 

Lys Thr Gly Glu 
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Thr Arg Glu Leu 

225 

Pro He Thr Gly 

lie Val Thr Asp 

260 

Ala Gly Asp Val 
275 

Ser Asp Gly Ala 
290 

Leu Gin Asn 
305 



Ala Cys Asp Ala 
230 

Leu Leu Pro Asp 
245 

Asp Glu Met Arg 

Arg Ala Lys Ser 

280 

Leu Ala Ala His 
295 



Val Phe Phe Phe 

235 

Ala Glu Lys Asp 
250 

Thr Ser Val Glu 
265 

Phe Arg Gin Val 

Ala Ala Ala Ser 

300 



He Gly Met Val 

240 

Ser Thr Gly Tyr 
255 

Gly He Phe Ala 
270 

He Thr Ala Thr 
285 

Tyr He Asp Thr 



<210> 225 
<211> 45 
<212> PRT 

<213> Vibrio fischeri 

<400> 225 , 

Met Asn Val Lys His Ser Lys Leu Leu He Leu Gly Ser Gly Pro Ala 

1 5 10 15 

Glv Tyr Thr Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Asn Pro Val 

20 25 30 

Met He Thr Gly Met Gin Gin Gly Gly Gin Leu Thr Asn 
35 .40 45 



<210> 226 
<211> 318 
<212> PRT 

<213> Saccharomyces 



cerevisiae 



<400> 226 
Val His Asn Lys 
1 

Ala Ala He Tyr 

20 

Gly Met Met Ala 
35 

Thr Glu He Glu 
50 

Glu Leu Met Asp 
65 

He He Thr Glu 

Lys Leu Trp Thr 

100 

Ala He He Leu 
115 

Gly Glu Glu Thr 
130 

Asp Gly Ala Val 
145 

Gly Gly Asp Ser 

Ser Lys Val Phe 

180 

He Met Gin Lys 
195 

Asn Thr Val Ala 
210 

Leu Arg He Lys 
225 

Ser Gly Leu Phe 
Ala Gly Gin Val 



Val Thr He He 
5 

Leu Ala Arg Ala 

Asn Gly He Ala 

40 

Asn Phe Pro Gly 
55 

Arg Met Arg Glu 
70 

Thr Val Ser Lys 
85 

Glu Phe Asn Glu 

Ala Thr Gly Ala 

120 

Tyr Trp Gin Lys 
135 

Pro He Phe Arg 
150 

Ala Cys Glu Glu 
165 

Met Leu Val Arg 

Arg Ala Glu Lys 

200 

Leu Glu Ala Lys 
215 

Asn Thr Lys Lys 
230 

Tyr Ala He Gly 
245 

Asp Thr Asp Glu 



Gly Ser Gly Pro 
10 

Glu He Lys Pro 
25 

Ala Gly Gly Gin 

Phe Pro Asp Gly 

60 

Gin Ser Thr Lys 
75 

Val Asp Leu Ser 
90 

Asp Ala Glu Pro 
105 

Ser Ala Lys Arg 

Gly He Ser Ala 

14 0 

Asn Lys Pro Leu 
155 

Ala Gin Phe Leu 
170 

Lys Asp His Leu 
185 

Asn Glu Lys He 

Gly Asp Gly Lys 

220 

Asn Glu Glu Thr 
235 

His Thr Pro Ala 
250 

Ala Gly Tyr He 



Ala Ala His Thr 
15 

He Leu Tyr Glu 
30 

Leu Thr Thr Thr 
45 

Leu Thr Gly Ser 

Phe Gly Thr Glu 

80 

Ser Lys Pro Phe 
95 

Val Thr Thr Asp 
110 

Met His Leu Pro 
125 

Cys Ala Val Cys 

Ala Val He Gly 

160 

Thr Lys Tyr Gly 
175 

Arg Ala Ser Thr 
190 

Glu He Leu Tyr 
205 

Leu Leu Asn Ala 

Asp Leu Pro Val 

240 

Thr Lys He Val 
255 

Lys Thr Val Pro 
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260 

Gly Ser Ser Leu 
275 

Gin Asp Ser Lys 
290 

Met Ala Ala Leu 
305 



Thr Ser Val Pro 

280 

Tyr Arg Gin Ala 
295 

Asp Ala Glu Lys 
310 



265 

Gly Phe Phe Ala 

He Thr Ser Ala 

300 

Tyr Leu Thr Ser 
315 



270 

Ala Gly Asp Val 
285 

Gly Ser Gly Cys 
Leu Glu 



<210> 227 
<211> 342 
<212> PRT 

<213> Saccharomyces 



cerevisiae 



<r400> 227 
Met He Lys His 
1 

Ser Lys Ser Val 

20 

Gly Ser Gly Pro 
35 

Glu Met Lys Pro 
50 

Ala Gly Gly Gin 
65 

Phe Pro Glu Ser 

Gin Ser Ala Lys 

100 

Val Asp Leu Ser 
115 

Asp Ala Glu Pro 
130 

Ser Ala Lys Arg 
145 

Gly He Ser Ala 

Asn Lys Pro Leu 

180 

Ala Glu Phe Leu 
195 

Lys Asp His Phe 
210 

Asn Pro Asn He 
225 

Gly Asp Gly Lys 

Asn Val Glu Asn 

260 

His Ser Pro Ala 
275 

Thr Gly Tyr He 
290 

Gly Phe Phe Ala 
305 

Val Thr Ser Ala 

Tyr Leu Ser Ala 

340 



He Val Ser Pro 

5 

Leu Ser Arg Met 

Ala Ala His Thr 

40 

Thr Leu Tyr Glu 
55 

Leu Thr Thr Thr 
70 

Leu Ser Gly Ser 
85 

Phe Gly Thr Asn 

Ser Lys Pro Phe 

120 

Val Thr Thr Asp 
135 

Met His Leu Pro 
150 

Cys Ala Val Cys 
165 

Ala Val He Gly 

Thr Lys Tyr Ala 

200 

Arg Ala Ser Val 
215 

He Val Leu Phe 
230 

Leu Leu Asn Met 
245 

Asp Leu Glu Val 

Thr Asp He Val 

280 

Lys Thr Val Pro 
295 

Ala Gly Asp Val 
310 

Gly Ser Gly Cys 
325 

Gin Glu 



Phe Axg Thr Asn 
10 

He His His Lys 
25 

Ala Ala He Tyr 

Gly Met Met Ala 

6 0 

Thr Asp He Glu 

75 

Glu Leu Met Glu 
90 

He He Thr Glu 
105 

Arg Leu Trp Thr 

Ala He He Leu 

140 

Gly Glu Glu Thr 
155 

Asp Gly Ala Val 
17 0 

Gly Gly Asp Ser 
185 

Ser Lys Val Tyr 

He Met Gin Arg 

220 

Asn Thr Val Ala 
235 

Leu Arg He Lys 
250 

Asn Gly Leu Phe 
265 

Lys Gly Gin Val 

Gly Ser Ser Leu 

300 

Gin Asp Ser Arg 
315 

He Ala Ala Leu 
330 



Phe Val Gly He 
15 

Val Thr He He 
30 

Leu Ala Arg Ala 
45 

Asn Gly He Ala 

Asn Phe Pro Gly 

80 

Arg Met Arg Lys 
95 

Thr Val Ser Lys 
110 

Glu Phe Asn Glu 
125 

Ala Thr Gly Ala 

Tyr Trp Gin Gin 

160 

Pro He Phe Arg 
175 

Ala Cys Glu Glu 
190 

He Leu Val Arg 
205 

Arg He Glu Lys 

Leu Glu Ala Lys 

240 

Asn Thr Lys Ser 
255 

Tyr Ala He Gly 
270 

Asp Glu Glu Glu 
285 

Thr Ser Val Pro 

Tyr Arg Gin Ala 

320 

Asp Ala Glu Arg 
335 



<210> 228 
<211> 499 
<212> PRT 
<213> Bos taurus 

<400> 228 



-131- 



WO 02/50289 



PCTYUS01/50240 



Met Asn Gly Ser 
1 

He lie Gly Gly 

20 

Lys Tyr Asp Lys 
35 

Leu Gly Thr Arg 
50 

He Pro Lys Lys 
65 

Arg Asp Ser Arg 

Asp Trp Glu Arg 

100 

Asn Trp Gly Tyr 
115 

Asn Ala Tyr Gly 
130 

Asn Lys Gly Lys 
145 

Thr Gly Glu Arg 

Cys He Ser Ser 

180 

Thr Leu Val Val 
195 

Leu Ala Gly He 
210 

Leu Arg Gly Phe 

225 

Gin Glu His Gly 

Glu Gin He Glu 

260 

Ser Thr Asp Ser 
275 

Leu Ala He Gly 
290 

Val Gly Val Lys 
305 

Glu Glu Gin Thr 

Glu Gly Lys Leu 

340 

Leu Ala Gin Arg 
355 

Asn Val Pro Thr 
370 

Leu Ser Glu Glu 
385 

Val Tyr His Ser 

Asp Asn Asn Lys 

420 

Glu Arg Val Val 
435 

Thr Gin Gly Phe 
450 

Leu Asp Ser Thr 
465 

Thr Leu Ser Val 
Cys Cys Gly 



Lys Asp Leu Pro 

5 

Gly Ser Gly Gly 

Lys Val Met Val 

40 

Trp Gly Leu Gly 
55 

Leu Met His Gin 
70 

Asn Tyr Gly Trp 
85 

Met Thr Glu Ala 

Arg Val Ala Leu 

120 

Glu Phe Val Gly 
135 

Glu Lys He Tyr 
150 

Pro Arg Tyr Leu 
165 

Asp Asp Leu Phe 

Gly Ala Ser Tyr 

200 

Gly Leu Asp Val 
215 

Asp Gin Asp Met 
230 

He Lys Phe He 
245 

Ala Gly Thr Pro 

Asp Gin Thr He 

280 

Arg Asp Ala Cys 
295 

He Asn Glu Lys 
310 

Asn Val Pro Tyr 
325 

Glu Leu Thr Pro 

Leu Tyr Gly Gly 

360 

Thr Val Phe Thr 
375 

Lys Ala Val Glu 
390 

Tyr Phe Trp Pro 
405 

Cys Tyr Ala Lys 

Gly Phe His Val 

440 

Ala Ala Ala Leu 
455 

He Gly He His 
470 

Thr Lys Arg Ser 
485 



Glu Pro Tyr Asp 
10 

Leu Ala Ala Ala 
25 

Leu Asp Phe Val 

Gly Thr Cys Val 

60 

Ala Ala Leu Leu 
75 

Asn Val Glu Glu 

90 

Val Gin Asn His 
105 

Arg Glu Lys Lys 

Pro His Arg He 

140 

Ser Ala Glu Arg 
155 

Gly He Pro Gly 
170 

Ser Leu Pro Tyr 
185 

Val Ala Leu Glu 

Thr Val Met Val 

220 

Ala Asn Lys He 
235 

Arg Gin Phe Val 
250 

Gly Arg Leu Arg 
265 

Glu Gly Glu Tyr 

Thr Arg Lys He 

300 

Thr Gly Lys He 
315 

He Tyr Ala He 
330 

Val Ala He Gin 
345 

Ser Thr Val Lys 

Pro Leu Glu Tyr 

380 

Lys Phe Gly Glu 
395 

Leu Glu Trp Thr 
410 

Val Val Cys Asn 
425 

Leu Gly Pro Asn 

Lys Cys Gly Leu 

460 

Pro Val Cys Ala 
475 

Gly Gly Asn He 
490 



Tyr Asp Leu He 
15 

Lys Glu Ala Ala 
30 

Thr Pro Thr Pro 
45 

Asn Val Gly Cys 

Gly Gin Ala Leu 

80 

Thr Val Lys His 
95 

He Gly Ser Leu 
110 

Val Thr Tyr Glu 
125 

Lys Ala Thr Asn 

Phe Leu He Ala 

160 

Asp Lys Glu Tyr 
175 

Cys Pro Gly Lys 
190 

Cys Ala Gly Phe 
205 

Arg Ser He Leu 

Gly Glu His Met 

24 0 

Pro He Lys Val 
255 

Val He Ala Lys' 
270 

Asn Thr Val Leu 
285 

Gly Leu Glu Asn 

Pro Val Thr Glu 

320 

Gly Asp He Leu 
335 

Ala Gly Arg Leu 
350 

Cys Asp Tyr Glu 
365 

Gly Ser Cys Gly 

Glu Asn Val Glu 

400 

He Pro Ser Arg 
415 

He Lys Asp Asn 
430 

Ala Gly Glu Val 
445 

Thr Lys Asp Gin 

Glu Val Phe Thr 

480 

Leu Gin Thr Gly 

495 



<210> 229 
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<211> 523 
<212> PRT 

<213> Caenorhabditis elegans 



Met°Tvr 2 Ile Lys Gly Asn Ala Val Gly Gly Leu Lys Glu Leu Lys Ala 

1 5 1° 15 

Leu Lys Gin Asp Tyr Leu Lys Glu Trp Leu Arg Asp His Thr Tyr Asp 

20 25 30 

Leu lie Val He Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala Lys Glu 

35 40 45 

Ala Ser Arg Leu Gly Lys Lys Val Ala Cys Leu Asp Phe Val Lys Pro 

50 55 60 

Ser Pro Gin Gly Thr Ser Trp Gly Leu Gly Gly Thr Cys Val Asn Val 
65 70 75 8( ? 

Gly Cys He Pro Lys Lys Leu Met His Gin Ala Ser Leu Leu Gly His 

85 90 9 ^ 

Ser He His Asp Ala Lys Lys Tyr Gly Trp Lys Leu Pro Glu Gly Lys 

100 . 105 HO 

Val Glu His Gin Trp Asn His Leu Arg Asp Ser Val Gin Asp His lie 

115 120 125 

Ala Ser Leu Asn Trp Gly Tyr Arg Val Gin Leu Arg Glu Lys Thr Val 

130 135 140 

Thr Tyr He Asn Ser Tyr Gly Glu Phe Thr Gly Pro Phe Glu He Ser 
145 150 155 ■/ 160 

Ala Thr Asn Lys Lys Lys Lys Val Glu Lys Leu Thr Ala Asp Arg Phe 

165 170 175 

Leu He Ser Thr Gly Leu Arg Pro Lys Tyr Pro Glu He Pro Gly Val 

180 185 190 

Lvs Glu Tyr Thr He Thr Ser Asp Asp Leu Phe Gin Leu Pro Tyr Ser 

* 195 200 205 

Pro Gly Lys Thr Leu Cys Val Gly Ala Ser Tyr Val Ser Leu Glu Cys 

210 215 220 

Ala Gly Phe Leu His Gly Phe Gly Phe Asp Val Thr Val Met Val Arg 
225 230 235 240 

Ser He Leu Leu Arg Gly Phe Asp Gin Asp Met Ala Glu Arg lie Arg 

245 250 255 

Lys His Met He Ala Tyr Gly Met Lys Phe Glu Ala Gly Val Pro Thr 

260 265 270 

Arq He Glu Gin He Asp Glu Lys Thr Asp Glu Lys Ala Gly Lys Tyr 

275 280 285 

Arq Val Phe Trp Pro Lys Lys Asn Glu Glu Thr Gly Glu Met Gin Glu 

290 * 295 300 

Val Ser Glu Glu Tyr Asn Thr He Leu Met Ala He Gly Arg Glu Ala 
305 310 315 320 

Val Thr Asp Asp Val Gly Leu Thr Thr He Gly Val Glu Arg Ala Lys 

325 330 335 

Ser Lys Lys Val Leu Gly Arg Arg Glu Gin Ser Thr Thr He Pro Trp 

340 345 350 

Val Tyr Ala He Gly Asp Val Leu Glu Gly Thr Pro Glu Leu Thr Pro 

355 * * 360 365 

Val Ala He Gin Ala Gly Arg Val Leu Met Arg Arg He Phe Asp Gly 

370 375 380 

Ala Asn Glu Leu Thr Glu Tyr Asp Gin He Pro Thr Thr Val Phe Thr 
385 390 395 400 

Pro Leu Glu Tyr Gly Cys Cys Gly Leu Ser Glu Glu Asp Ala Met Met 

405 410 4X5 

Lys Tyr Gly Lys Asp Asn He He He Tyr His Asn Val Phe Asn Pro 

420 425 ( 430 

Leu Glu Tyr Thr He Ser Glu Arg Met Asp Lys Asp His Cys Tyr Leu 

435 440 445 

Lys Met He Cys Leu Arg Asn Glu Glu Glu Lys Val Val Gly Phe His 

450 455 460 

He Leu Thr Pro Asn Ala Gly Glu Val Thr Gin Gly Phe Gly He Ala 
465 470 475 480 

Leu Lys Leu Ala Ala Lys Lys Ala Asp Phe Asp Arg Leu He Gly He 

~ 490 4 95 



485 
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His Pro Thr Val 

500 

Glu Gly Asp Glu 
515 



Ala Glu Asn Phe 

Glu Leu Gin Ala 

520 



Thr Thr Leu Thr 

505 

Ser Gly Cys 



Leu Glu Lys Lys 
510 



<210> 230 
<211> 497 
<212> PRT 

<213> Homo sapiens 



<400> 230 
Met Asn Gly Pro 
1 

lie lie Gly Gly 

20 

Gin Tyr Gly Lys 
35 

Leu Gly Thr Arg 
50 

lie Pro Lys Lys 
65 

Gin Asp Ser Arg 

Asp Trp Asp Arg 

100 

Asn Trp Gly Tyr 
115 

Asn Ala Tyr Gly 
130 

Asn Lys Gly Lys 
145 

Thr Gly Glu Arg 

Cys He Ser Ser 

180 

Thr Leu Val Val 
195 

Leu Ala Gly lie 
210 

Leu Arg Gly Phe 
225 

Glu Glu His Gly 

Glu Gin He Glu 

260 

Ser Thr Asn Ser 
275 

Leu Ala He Gly 
290 

Val Gly Val Lys 
.3 05 

Glu Glu Gin Thr 

Glu Asp Lys Val 

340 

Leu Ala Gin Arg 
355 

Asn Val Pro Thr 
370 

Leu Ser Glu Glu 
385 

Val Tyr His Ser 

Asp Asn Asn Lys 

420 

Glu Arg Val Val 



Glu Asp Leu Pro 
5 

Gly Ser Gly Gly 

Lys Val Met Val 

40 

Trp Gly Leu Gly 
55 

Leu Met His Gin 
70 

Asn Tyr Gly Trp 
85 

Met He Glu Ala 

Arg Val Ala Leu 

120 

Gin Phe He Gly 
135 

Glu Lys He Tyr 
150 

Pro Arg Tyr Leu 
165 

Asp Asp Leu Phe 

Gly Ala Ser Tyr 

200 

Gly Leu Gly Val 
215 

Asp Gin Asp Met 
230 

He Lys Phe He 
245 

Ala Gly Thr Pro 

Glu Glu He He 

280 

Arg Asp Ala Cys 
295 

He Asn Glu Lys 
310 

Asn Val Pro Tyr 
325 

Glu Leu Thr Pro 

Leu Tyr Ala Gly 

360 

Thr Val Phe Thr 
375 

Lys Ala Val Glu 
390 

Tyr Phe Trp Pro 
405 

Cys Tyr Ala Lys 
Gly Phe His Val 



Lys Ser Tyr Asp 
10 

Leu Ala Ala Ala 
25 

Leu Asp Phe Val 

Gly Thr Cys Val 

60 

Ala Ala Leu Leu 
75 

Lys Val Glu Glu 
90 

Val Gin Asn His 
105 

Arg Glu Lys Lys 

Pro His Arg He 

140 

Ser Ala Glu Ser 
155 

Gly He Pro Gly 
170 

Ser Leu Pro Tyr 
185 

Val Ala Leu Glu 

Thr Val Met Val 

220 

Ala Asn Lys He 
235 

Arg Gin Phe Val 
250 

Gly Arg Leu Arg 
265 

Glu Gly Glu Tyr 

Thr Arg Lys He 

300 

Thr Gly Lys He 
315 

He Tyr Ala He 
330 

Val Ala He Gin 
345 

Ser Thr Val Lys 

Pro Leu Glu Tyr 

380 

Lys Phe Gly Glu 
3 95 

Leu Glu Trp Thr 
410 

He He Cys Asn 
425 

Leu Gly Pro Asn 



Tyr Asp Leu He 
15 

Lys Glu Ala Ala 
30 

Thr Pro Thr Pro 
45 

Asn Val Gly Cys 

Gly Gin Ala Leu 

80 

Thr Val Lys His 
95 

He Gly Ser Leu 
110 

Val Val Tyr Glu 
125 

Lys Ala Thr Asn 

Phe Leu He Ala 

160 

Asp Lys Glu Tyr 
175 

Cys Pro Gly Lys 
190 

Cys Ala Gly Phe 
205 

Arg Ser He Leu 

Gly Glu His Met 

240 

Pro He Lys Val 
255 

Val Val Ala Gin 
270 

Asn Thr Val Met 
285 

Gly Leu Glu Thr 

Pro Val Thr Asp 

320 

Gly Asp He Leu 
335 

Ala Gly Arg Leu 
350 

Cys Asp Tyr Glu 
365 

Gly Ala Cys Gly 

Glu Asn He Glu 

400 

He Pro Ser Arg 
415 

Thr Lys Asp Asn 
430 

Ala Gly Glu Val 
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435 

Thr Gin Gly Phe 
450 

Leu Asp Ser Thr 
465 

Thr Leu Ser Val 
Cys 



440 

Ala Ala Ala Leu 
455 

He Gly lie His 
470 

Thr Lys Arg Ser 
485 



Lys Cys Gly Leu 

460 

Pro Val Cys Ala 
475 

Gly Ala Ser He 
490 



445 

Thr Lys Lys Gin 

Glu Val Phe Thr 

480 

Leu Gin Ala Gly 
495 



<210> 231 
<211> 541 
<212> PRT 

<213> Plasmodium falciparum 

<400> 231 , 

Met Cys Lys Asp Lys Asn Glu Lys Lys Asn Tyr Glu His Val Asn Ala 

1 .5 10 15 

Asn Glu Lys Asn Gly Tyr Leu Ala Ser Glu Lys Asn Glu Leu Thr Lys 

20 25 30 

Asn Lys Val Glu Glu His Thr Tyr Asp Tyr Asp Tyr Val Val He Gly 

35 40 45 

Gly Gly Pro Gly Gly Met Ala Ser Ala Lys Glu Ala Ala Ala His Gly 

50 55 60 

Ala Arg Val Leu Leu Phe Asp Tyr Val Lys Pro Ser Ser Gin Gly Thr 
65 ~ 70 75 80 

Lys Trp Gly He Gly Gly Thr Cys Val Asn Val Gly Cys Val Pro Lys 

85 90 95 

Lys Leu Met His Tyr Ala Gly His Met Gly Ser He Phe Lys Leu Asp 

100 105 HO 

Ser Lys Ala Tyr Gly Trp Lys Phe Asp Asn Leu Lys His Asp Trp Lys 

115 ** ' 120 125 

Lys Leu Val Thr Thr Val Gin Ser His He Arg Ser Leu Asn Phe Ser 

130 135 140 

Tyr Met Thr Gly Leu Arg Ser Ser Lys Val Lys Tyr He Asn Gly Leu 
145 150 155 160 

Ala Lys Leu Lys Asp Lys Asn Thr Val Ser Tyr Tyr Leu Lys Gly Asp 

165 ~ 170 175 

Leu Ser Lys Glu Glu Thr Val Thr Gly Lys Tyr He Leu He Ala Thr 

180 185 190 

Gly Cys Arg Pro His He Pro Asp Asp Val Glu Gly Ala Lys Glu Leu 

195 200 205 

Ser He Thr Ser Asp Asp He Phe Ser Leu Lys Lys Asp Pro Gly Lys 

210 215 220 

Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu Cys Ser Gly Phe 
225 230 235 240 

Leu Asn Ser Leu Gly Tyr Asp Val Thr Val Ala Val Arg Ser He Val 

245 250 255 

Leu Arg Gly Phe Asp Gin Gin Cys Ala Val Lys Val Lys Leu Tyr Met 

260 265 270 

Glu Glu Gin Gly Val Met Phe Lys Asn Gly He Leu Pro Lys Lys Leu 

275 280 285 

Thr Lys Met Asp Asp Lys He Leu Val Glu Phe Ser Asp Lys Thr Ser 

290 295 300 

Glu Leu Tyr Asp Thr Val Leu Tyr Ala He Gly Arg Lys Gly Asp He 
305 310 315 320 

Asp Gly Leu Asn Leu Glu Ser Leu Asn Met Asn Val Asn Lys Ser Asn 

325 330 335 

Asn Lys He He Ala Asp His Leu Ser Cys Thr Asn lie Pro Ser He 

340 345 350 

Phe Ala Val Gly Asp Val Ala Glu Asn Val Pro Glu Leu Ala Pro Val 

355 " " 360 365 

Ala He Lys Ala Gly Glu He Leu Ala Arg Arg Leu Phe Lys Asp Ser 

370 375 380 

Asp Glu He Met Asp Tyr Ser Tyr He Pro Thr Ser He Tyr Thr Pro 
385 390 395 400 
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lie 


Glu 


Tyr 


Gly 


Tyr 


Gly 


Lys 


Ser 








420 


Glu 


He 


Ser 


Ala 






435 




Glu 


Tyr 


Asp 


Leu 




450 






Leu 


Lys 


Asn 


Glu 


465 






Asn 


Ala 


Gly 


Glu 


Val 


Lys 


Lys 


Lys 








500 


Ala 


Glu 


Ser 


Phe 






515 




Tyr 


Ala 


Ala 


Lys 



530 



Ala Cys Gly Tyr 
405 

Asn Val Glu Val 

Val His Arg Gin 

440 

Asp Val Ser Ser 
455 

Asp Asn Arg Val 
470 

Val Thr Gin Gly 
485 

Asp Phe Asp Asn 

Met Asn Leu Phe 

520 

Gly Gly Cys Gly 
535 



Ser Glu Glu Lys 
410 

Phe Leu Gin Glu 
425 

Lys His He Arg 

Thr Cys Leu Ala 

460 

He Gly Phe His 
475 

Met Ala Leu Ala 
490 

Cys He Gly He 
505 

Val Thr He Ser 

Gly Gly Lys Cys 

540 



Ala Tyr Glu Leu 
415 

,Phe Asn Asn Leu 
430 

Ala Gin Lys Asp 
445 

Lys Leu Val Cys 

Tyr Val Gly Pro 

480 

Leu Arg Leu Lys 
495 

His Pro Thr Asp 
510 

Ser Gly Leu Ser 

525 

Gly 



<210> 232 
<211> 535 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 232 
Met Ala Ala Ser 
1 

Pro His Arg Val 

20 

Leu Phe Phe Leu 

35 

Leu Leu Arg Gin 
50 

Arg Val Ser Ala 
65 

Glu He He Glu 

Thr Ala Ala lie 

100 

Glu Gly Tyr Gin 
115 

Thr Glu Val Glu 
130 

Asp Leu Met Glu 
145 

Leu Tyr Pro Glu 

Thr Val Gin Thr 

180 

Ala Thr Gly Ala 
195 

Phe Trp Ser Arg 
210 

Pro Leu Phe Lys 
225 

Ala Thr Glu Glu 

Leu Leu Val Arg 

260 

Arg Val He Asn 
275 

Val Asp Val Leu 
290 

Arg Arg Leu Asp 
305 

Phe Tyr Gly lie 



Pro Lys He Gly 
5 

Ser Ala Ala Ser 

Thr Thr Thr Thr 

40 

Pro Thr Arg Thr 
55 

Thr Ala Asn Ser 
70 

Asn Val Val He 
85 

Tyr Ala Ala Arg 

Met Gly Gly Val 

120 

Asn Phe Pro Gly 
135 

Lys Met Arg Lys 
150 

Asp Val Glu Ser 
165 

Ser Glu Arg Lys 

Thr Ala Arg Arg 

200 

Gly He Ser Ala 
215 

Gly Gin Val Leu 
230 

Ala Leu Tyr Leu 
245 

Arg Asp Gin Leu 

Asn Pro Asn He 

280 

Ser Asn Thr Lys 
295 

Thr Gly Glu Glu 
310 

Gly His Ser Pro 



He Gly He Ala 
10 

Ser Ala Leu Ser 
25 

Thr Thr Arg His 

Arg Ser Ser Asp 

60 

Pro Ser Ser Ser 
75 

He Gly Ser Gly 
90 

Ala Asn Leu Lys 
105 

Pro Gly Gly Gin 

Phe Pro Asp Gly 

140 

Gin Ala Glu Arg 
155 

Leu Ser Val Thr 
170 

Val Lys Cys His 
185 

Leu Arg Leu Pro 

Cys Ala He Cys 

220 

Ala Val Val Gly 
235 

Thr Lys Tyr Ala 
250 

Arg Ala Ser Lys 
265 

Thr Val His Tyr 

Gly Gin Met Ser 

300 

Thr Glu Leu Glu 
315 

Asn Ser Gin Leu 



Ser Val Ser Ser 

15 

Pro Pro Pro His 
30 

Gly Gly Ser Tyr 
45 

Ser Leu Arg Leu 

Ser Ser Gly Gly 

80 

Pro Ala Gly Tyr 
95 

Pro Val Val Phe 
110 

Leu Met Thr Thr 
125 

He Thr Gly Pro 

Trp Gly Ala Glu 

160 

Thr Ala Pro Phe 
175 

Ser He He Tyr 
190 

Arg Glu Glu Glu 
205 

Asp Gly Ala Ser 

Gly Gly Asp Thr 

240 

Arg His Val His 
255 

Ala Met Gin Asp 
270 

Asn Thr Glu Thr 
285 

Gly He Leu Leu 

Ala Lys Gly Leu 

320 

Leu Glu Gly Gin 
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325 










330 




Val 


Glu 


Leu 


Asp 


Ser 


Ser 


Gly Tyr 


Val 


Leu 


Val 








340 










345 






Asn 


Thr 


Ser 


Val 


Glu 


Gly 


Val 


Phe 


Ala Ala Gly 






355 










360 








Glu 


Trp 
370 


Arg 


Gin 


Ala 


Val 


Thr 
375 


Ala 


Ala 


Gly 


Ser 


Leu 


Ser 


Ala 


Glu 


Arg 


Tyr 


Leu 


Thr 


Ser 


Asn 


Asn 


385 








390 










395 


His 


Gin 


Pro 


Gin 


Thr 
405 


Glu 


Glu 


Ala 


Lys 


Lys 
410 


Glu 


Val 


Gin 


Glu 


Lys 
420 


Phe 


Asp 


lie 


Thr 


jjeu 
425 


Thr 


Lys 


Ala 


Leu 


Arg 


Lys 


Leu 


Tyr 


His 


Glu 


Ser 


Pro Arg 






435 










440 






Thr 


Tyr 


T 

Thr 


Ser 


Pro 


Thr 


Cys 


Gly 


Pro 


Cys 


Arg 


450 










455 




His 




Val 


Asn 


Lys 


Val 


Val 


Asp 


Glu 


Tyr Asn 


Asp 


465 








470 










475 


Asp 


lie 


Glu 


Glu. 


Asp 


Gin 


Glu 


lie 


Ala 


Glu 


Ala 








485 










490 




Thr 


Pro 


Cys 


Val 


Gin 


Phe 


Phe 


Lys 


Asn 


Lys 


Glu 






500 










505 






Asn 


Val 


Leu 
515 


Ser 


Val 


Leu 


Lys 


Leu 
520 


His 


Arg 


Leu 


Ala 


Lys 
530 


Asp 


Ser. 


Glu 


Ser 


Val 
535 











335 

Arg Glu Gly Thr Ser 
350 

Asp Val Gin Asp His 
365 

Gly Cys lie Ala Ala 
380 

Leu Leu Val Glu Phe 

400 

Phe Thr Gin Arg Asp 

415 

His Lys Gly Gin Tyr 
430 

Val lie Leu Val Leu 
445 

Leu Lys Pro He Leu 
460 

His Phe Val Glu He 

480 

Ala Gly He Met Gly 

495 

Met Leu Arg Leu Gly 
510 

Leu Cys Ser Gly Leu 
525 



<210> 233 
<211> 117 
<212> PRT 

<213> Helianthus annuus 
<400> 233 

Ala Val Val Glu Ala Tyr Gly Glu Glu Gly Lys Asn Val Leu Gly Gly 

1 5 10 15 

Leu Lys Val Lys Asn Val Val Ser Gly Glu Val Ser Asp Leu Lys Val 

20 25 30 

Asn Gly Leu Phe Phe Ala He Gly His Glu Pro Ala Thr Lys Phe Leu 

35 40 45 

Asp Gly Gin Leu Glu Leu Asp Ser Asp Gly Tyr Val Val Thr Lys Pro 

50 55 60 

Gly Thr Thr He Ser Ser Val Lys Gly Val Phe Ala Ala Gly Asp Val 
65 70 75 80 

Gin Asp Lys Lys Tyr Arg Gin Ala Val Thr Ala Ala Gly Ser Gly Cys 

85 90 95 

Met Ala Ala Leu Asp Ala Glu His Tyr Leu Gin Glu He Gly Ser Gin 

100 105 HO 

Glu Gly Lys Ser Asp 
115 



<210> 234 
<211> 300 
<212> PRT 

<213> Arcaeoglobus fulgidus 
<400> 234 

Met Tyr Asp Val Ala He He Gly Gly Gly Pro Ala Gly Leu Thr Ala 

15 10 15 

Ala Leu Tyr Ser Ala Arg Tyr Gly Leu Lys Thr Val Phe Phe Glu Thr 

20 25 30 

Val Asp Pro Val Ser Gin Leu Ser Leu Ala Ala Lys He Glu Asn Tyr 

35 40 45 

Pro Gly Phe Glu Gly Ser Gly Met Glu Leu Leu Glu Lys Met Lys Glu 
50 55 60 
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Gin 


Ala 


Val 


Lys 


65 








Val 


Glu 


Arg 


Asn 


Tyr 


Glu 


Ala 


Lys 






100 


Ala 


Gly 


He 


Glu 






115 




Cys 


Ala 


Thr 


Cys 




130 






Tyr 


Gly 


Ser 


Gly 


145 








He 

• 


Gly 


Cys 


Glu 


Glu 


Lys 


Ala 


Leu 








180 


Tyr 


Ser 


Thr 


Thr 




195 




Val 


Val 


Ala 


Tyr 




210 






Asp 


Gly 


He 


Phe 


225 








Ala 


Glu 


Leu Gly 


Lys 


Glu 


Gin Arg 






260 


Cys 


Asp 


Asn 


Pro 






275 




Val 


Ala 


Ala 


Tyr 




290 







Ala Gly Ala Glu 
70 

Gly Glu Thr Phe 
85 

Ala He He Val 

Gly Glu Ser Ala 

120 

Asp Gly Asn Phe 
135 

Lys Glu Ala He 
150 

Val Thr He Val 
165 

Val Glu Glu Val 

He Arg Lys He 

200 

Asn Arg Glu Lys 
215 

Val Ala He Gly 
230 

Val Glu Arg Asp 
245 

Thr Asn Val Glu 

Leu Lys Gin Val 

2 80 

Ser Ala Tyr Lys 
295 



Trp Lys Leu Glu 
75 

Thr Val He Ala 
90 

Ala Thr Gly Gly 
105 

Phe He Gly Arg 

Phe Arg Gly Lys 

140 

Glu Asp Ala He 
155 

Ser Arg Thr Pro 
170 

Glu Lys Arg Gly 
185 

He Gly Ser Gly 

Lys Glu Glu Phe 

220 

Met Arg Pro Ala 
235 

Ser Met Gly Tyr 
250 

Gly Val Phe Ala 
265 

Val Thr Ala Cys 

Tyr Leu Thr Ser 

300 



Lys Val Glu Arg 

8 0 

Glu Gly Gly Glu 
95 

Lys His Lys Glu 
110 

Gly Val Ser Tyr 
125 

Lys Val He Val 

Tyr Leu His Asp 

160 

Ser Phe Arg Ala 

175 

He Pro Val His 
190 

Lys Val Glu Lys 
205 

Glu He Glu Ala 

Thr Asp Val Val 

240 

He Lys Val Asp 
255 

Ala Gly Asp Cys 
270 

Gly Asp Gly Ala 
2 85 



<210> 235 
<211> 315 
<212> PRT 
<213> Bacillus 



halodurans 



<400> 235 
Met Gly Glu Glu 
1 

Pro Ala Gly Met 

20 

Thr Val Met Val 
35 

Glu Asp Val Glu 
50 

Leu Ser Thr Lys 
65 

Ala Tyr Gly Asp 

Val Lys Ala Gly 

100 

Thr Gly Ala Glu 
115 

Ser Gly Arg Gly 
130 

Lys Gly Lys Glu 
145 

Glu Ala Val Tyr 

Arg Arg Asp Gin 

18 0 

Asp Asn Asp Lys 
195 

Asn Gly Thr Asp 
210 

Thr Gly Glu Gin 



Gin Lys Val Tyr 
5 

Thr Ala Ala Val 

Glu Arg Gly Val 

40 

Asn Tyr Pro Gly 
55 

Met Phe Glu His 
70 

He Lys Glu lie 
85 

Asn Lys Glu Tyr 

Tyr Lys Lys Leu 

120 

Val Ser Tyr Cys 
135 

Leu Val Val Val 
150 

Leu Thr Arg Phe 
165 

Leu Arg Ala Gin 

He Glu Phe He 

200 

Gly Lys Val Ser 
215 

Gin Asp Phe Lys 



Asp Val Val He 
10 

Tyr Thr Ser Arg 
25 

Pro Gly Gly Gin 

Phe Asp His He 

60 

Ala Lys Lys Phe 
75 

He Asp Gin Gly 
90 

Lys Ala Arg Ala 
105 

Gly Val Pro Gly 

Ala Val Cys Asp 

140 

Gly Gly Gly Asp 
155 

Ala Ser Lys Val 
170 

Lys He Leu Gin 
185 

Trp Asp His Val 

Ser Val Thr lie 

220 

Thr Asp Gly Val 



Ala Gly Ala Gly 
15 

Ala Asn Leu Ser 
30 

Met Ala Asn Thr 
45 

Leu Gly Pro Glu 

Gly Ala Glu Tyr 

80 

Asp Leu Lys Leu 
95 

Val He Val Ala 
110 

Glu Lys Glu Leu 
125 

Gly Ala Phe Phe 

Ser Ala Val Glu 

160 

Thr He He His 
175 

Gin Arg Ala Phe 
190 

Val Lys Gin He 
205 

Glu His Ala Lys 
Phe He Tyr lie 
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225 






230 






Gly Met 


Leu 


Pro 


Leu Asn 


Glu 


Ala 






245 






Asp Glu 


Gly 


Tyr 
260 


He Val 


Thr 


Asn 


Gly He 


Phe 


Ala 


Ala Gly 


Asp 


Val 


275 








280 


Val Thr 


Ala 


Thr 


Gly Asp 


Gly 


Ser 


290 








295 




Tyr He 


Glu 


Glu 


Leu Ala 


Glu 


Lys 


305 






310 







235 240 
Val Lys Asn Leu Asn He Leu Asn 

250 255 
Glu Glu Met Glu Thr Ser Val Pro 
265 270 
Arg Glu Lys Ser Leu Arg Gin He 

285 

Leu Ala Ala Gin Asn Val Gin His 

300 

Val Lys Asn 
315 



<210> 236 
<211> 330 
<212> PRT 

<213> Bacillus halodurans 

<400> 236 , _ _ 

Met Ser Arg Lys Glu Glu Leu Tyr Asp He Thr He He Gly Gly Gly 

15 10 15 

Pro Thr Gly Leu Phe Ala Ala Phe Tyr Gly Gly Met Arg Gin Ala Lys 

20 25 30 

Val Lys He He Glu Ser Met Pro Gin Leu Gly Gly Gin Leu Ala Ala 

35 40 45 

Leu Tyr Pro Glu Lys Tyr He Tyr Asp Val Ala Gly Phe Pro Lys Val 

50 55 60 

Lvs Ala Gin Asp Leu Val Asn Asp Leu Lys Arg Gin Ala Glu Gin Phe 
65 70 75 80 

Asn Pro Thr He Ala Leu Glu Gin Ser Val Gin Asn Val Thr Lys Glu 

85 90 95 

Thr Asp Asp Thr Phe Thr He Lys Thr Asp Lys Glu Thr His Tyr Ser 

100 105 HO 

Lys Ala He He He Thr Ala Gly Ala Gly Ala Phe Gin Pro Arg Arg 

115 120 125 

Leu Glu Val Glu Gly Ala Lys Gin Tyr Glu Gly Lys Asn Leu Gin Tyr 

130 135 140 

Phe Val Asn Asp Leu Asn Ala Tyr Ala Gly Lys Asn Val Leu He Ser 
145 150 155 160 

Gly Gly Gly Asp Ser Ala Val Asp Trp Ala Leu Met Leu Glu Pro Val 

165 170 175 

Ala Lys Asn Val Thr Leu He His Arg Arg Asp Lys Phe Arg Ala His 

180 185 190 

Glu His Ser Val Glu Leu Leu Gin Lys Ser Ser Val Asn He Leu Thr 

195 200 205 

Pro Phe Ala He Ser Glu Leu Ser Gly Asp Gly Glu Lys He His His 

210 215 220 

Val Thr He Gin Glu Val Lys Gly Asp Ala Val Glu Thr Leu Asp Val 
225 230 235 240 

Asp Glu Val He Val Asn Phe Gly Phe Val Ser Ser Leu Gly Pro He 

245 250 255 

Lvs Glv Trp Gly Leu Glu He Glu Lys Asn Ser He Val Val Asn Thr 

260 265 270 

Lys Met Glu Thr Asn He Pro Gly He Tyr Ala Ala Gly Asp He Cys 

275 280 285 

Thr Tyr Pro Gly Lys Val Lys Leu He Ala Thr Gly Phe Gly Glu Ala 

290 295 300 

Pro Thr Ala Val Asn Asn Ala Lys Ala Phe He Asp Pro Thr Ala Arg 
305 310 315 320 

Val Phe Pro Gly His Ser Thr Ser Leu Phe 

325 330 



<210> 237 
<211> 213 
<212> PRT 
<213> Bacillus 



halodurans 
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<400> 237 




























Met 


Thr 


Asn 


JjcU 




-Lyr 


r PVi ~t- 
X I1X 


V cxX 


Lys 


Ser 


JJC IX 


l JG L» 








A nn 


1 








c 










10 










15 




Lys 


Thr 


Val 


Tip 


Tl A 

-J- -L tr 


O o >■» 
OCX 


ril -it 


oxy 


Gly Asn 


o trx 


Ala 


X X c 


Sen 


-LXp 


Ala 








2 0 










25 










30 






Asn 


Glu 


Leu 
35 


VJX Li. 


Prn 

t i. U 


Tie 

X -L C 


Al a 

/4X CL 


40 


Lys 


Val 


x Y *• 




Thr 

J- Hi. 

45 


Tvr 


Arg 




Glu 


Ala 


Leu 


7\ c? n 


Glv 
oxy 


His 


Glu 


Ala 

xl 1 C-* 


Gin 


He 


Ser 


Gin 


Leu 


Leu 


Ser 


Ser 




50 








55 










60 










Ser 


Ala 


Thr 


Pvs 


-i~J ' — V*& 


Phe 

X, -L J- V— 


His 


Thr 


Thr 


He 


Ser 

v »»* iJ— 


Lvs 


Leu 


He 


Ala 


Arcr 


65 










70 










75 










80 


Asp 


Asn 


Lys 


Gl n 


V CI — L 


Tie 

_1- -L. vZ> 


Glu 


Gl n 


Val 


Glu 


J-J 


Thr 


Astd 


His 


Gin 


Thr 






85 










90 










95 




Gly 


Glu 


Val 


Thr 

X 1 J. JL 


A cj n 


T 11=11 


Al a 


Val 

V CL J. 


Asp 


Glu 


Val 

V CI JL 


He 

«l— 


He 


Asn 


His 


Glv 






100 










105 










110 






Tyr 


Glu 


Arg 


t\c> LJ 




Opr 


JJCU 


Ut-» LI 


Asp 


Gin 




Glu 


Val 


Thr 

-J- X X x«- 


Leu 


Asp 






115 

■1* -J-. w 










12 0 










125 




Thr 

X. X 1 .1 ■ 




Arg 


lie 


Asp 




iyr 


xyr 


Tl o 
lie 


Ala 


Gly Thr 


f i u 


Til T* 




Ala 


Ser 




130 










135 










14 0 










Val 


Gly 


Gly 


He 


Tyr 


Ala 


Ala 


Gly 


Asp 


Val 


Leu 


Lys 


His 


Glu 


Gly 


Lys 


145 










150 










155 










160 


Leu 


His 


Leu 


He 


Ala 
165 


Gly 


Ala 


Phe 


Gin 


Asp 
170 


Ala 


Ala 


Asn 


Ala 


Val 
175 


Asn 


Gin 


Ala 


Lys 


Gin 


Trp 


He 


Glu 


Pro 


Glu 


Ala 


His 


Gin 


Ser 


Ala 


Met 


Val 






180 








185 










190 






Ser 


Ser 


His 
195 


Asn 


His 


Val 


Phe 


Lys 
200 


Glu 


Arg 


Asn 


Arg 


Glu 
205 


Leu 


He 


Arg 


Gin 


Met 
210 


Leu 


Lys 


Asn 

























<210> 238 
<211> 136 
<212> PRT 

<213> Bacillus halodurans 
<400> 238 



Met 


Asn 


Trp 


Glu 


Glu 


Leu 


Tyr 


Asp 


Val Thr He He 


Gly 


Gly 


Gly 


Pro 


1 






5 








10 






15 




Ala 


Gly 


Leu 


Phe 


Ser 


Ala 


Phe 


Tyr 


Ser Gly Leu Arg 


Glu 


Met 


Lys 


Thr 






20 








25 




30 






Lys 


Val 


He 


Glu 


Tyr 


Gin 


Pro 


Met 


Leu Gly Gly Lys 


Val 


His 


Val 


Tyr 






35 










40 




45 








Pro 


Glu 


Lys 


Met 


He 


Trp 


Asp 


Val 


Gly Gly Leu Thr 


Pro 


He 


Leu 


Gly 




50 








55 




60 










Glu 


Lys 


Leu 


He 


Glu 


Gin 


Leu 


Val 


Thr Gin Ala Leu 


Thr 


Phe 


Asn 


Pro 


65 








70 






75 








80 


Thr 


Val 


Val 


Leu 


Asn 


Glu 


Lys 


Val 


Thr Ser He Ala 


Gin 


Glu 


Glu 


Ser 










85 








90 






95 




Gly 


Trp 


Phe 


Val 


He 


Arg 


Thr 


Ala 


Ser Gly Arg Ala 


His 


Leu 


Thr 


Lys 








100 










105 




110 






Thr 


Val 


He 


He 


Ala 


Val 


Gly 


Gly 


Gly He Leu Lys 


Pro 


Gin 


Lys 


Asn 






115 










120 




125 








Arg 


Ala 


Arg Arg 


Gly Arg 


Thr 


He 














130 










135 















<210> 239 
<211> 312 
<212> PRT 

<213:> Campylobacter jejuni 
<400> 239 

Met Leu Asp Val Ala He He Gly 

1 5 
Gly Leu Tyr Ala Thr Arg Gly Gly 

20 



Gly Gly Pro Ala Gly Leu Ser Ala 

10 15 
Leu Lys Asn Val Val Met Phe Glu 
25 30 
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Lys 


wiy 




riU 


y 


Gly 


Gin 


Tie 


X AXX. 


Ser 


Ser 


Ser 


Glu 


He 


Glu 


Asn 


3 5 










40 










45 








J.yr 


rxO 


vj j.y 


v a. _L 


Al a 


Gin 


Val 


Met 


Asn 


Gly 


He 


Ser 


Phe 


Met 


Ala 


Pro 


50 








55 










60 










J. if 


l_> C I. 


Glu 


Gin 


Cvs 


Met 


Arg 


Phe 


Gly 


Leu 


Lys 


His 


Glu 


Met 


Yal 


Gly 

■ml 


65 










70 










75 










80 


V d JL 


V3-L \X 


Gin 


He 

X ^ 


Leu 


Lys 


Asn 


Ser 


Asp 


Gly Ser 


Phe 


Thr 


He 


Lys 


Leu 










85 








90 










95 




VjXU 






U Y S 


Thr 


Glu 


Leu 


Ala 


Lys 


Ala 


Val 


He 


Val 


Cys 


Thr 


Gly 




100 










105 










110 






•J — X 


AT a 

-T"\ _1_ CL 


XT -L. 1 


T ,VR 


xjy t» 


Ala Gly 


Phe 


Lys 


Gly Glu Asp 


Glu 


Phe 


Phe 


Gly 

-4 






115 










120 










125 








ijy S 


\aJ- y 


V CL-L 


Ser 


Thr 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 


Lys 


Asn 


130 










135 










14 0 










Jbys 


l>JT J. u 


Ua 1 
V d J_ 


Ala 


Val 


Leu 


Gly Gly 


Gly Asp 


Thr 


Ala 


Leu 


Glu 


Glu 


Ala 


J. ft z> 










150 










155 






His 




160 


j_ieu 


iyr 


JjcU 


Ala 


Asn 


He 


Cys 


Ser 


Lys 


He 


Tyr 


Leu 


He 

•mmm, kli ^u> 


Ara 


Arq 








165 










170 










175 






m ii 

\J J- Li 


Phe 


Arg 


Ala 


Ala 


Pro 


Ser 


Thr 


Val 


Glu 


Lys 


Val 


Lys 


Lys 


Asn 






180 










185 










190 






vjXU 


uy o 


He 


Glu 


Leu 


He 


Thr 


Ser 


Ala 


Ser 


Val 


Asp 


Glu 


Val 


Tyr 


Gly 

ml 




195 










200 










205 










TjVS 


Met 


Gly Val 


Ala -Gly Val 


Lys 


Val 


Lys 


Leu 


Lys 


Asp 

Am 


Gly 


Ser 


210 










215 










220 










Tie 




Asp 


Leu 


Asn 


Val 


Pro 


Gly 


He 


Phe 


Thr 


Phe 


Val 


Gly 


Leu 


Asn 


225 






230 










235 










240 


Val 


Arg 


Asn 


Glu 


He 


Leu 


Lys 


Gin 


.Asp 


Asp 


Ser 


Lys 


Phe 


Leu 


Cys 


Asn 








245 










250 










255 




Met 


Glu 


Glu Gly Gly 


Gin 


Val 


Ser 


Val 


Asp 


Leu 


Lys 


Met 


Gin 


Thr 


Ser 








'260 










265 










270 






Val 


Ala 


Gly 


Leu 


Phe 


Ala 


Ala 


Gly Asp 


Leu 


Arg 


Lys 


Asp 


Ala 


Pro 


Lys 






275 










280 










285 








Gin 


Val 


He 


Cys 


Ala 


Ala 


Gly Asp 


Gly 


Ala 


Val 


Ala 


Ala 


Leu 


Ser 


Ala 




290 








295 










300 










Met 


Ala 


Tyr 


lie 


Glu 


Ser 


Leu 


His 


















305 








310 























<210> 240 
<211> 348 
<212>- PRT 

<213> Caulobacter crescentus 



<400> 240 



Met 


Ser 


Pro 


Leu 


Arg 


Arg 


He 


His 


Thr 


He 


Ser 


Pro 


Pro 


Met 


Ser 


Thr 


1 








5 








10 










15 




Leu 


Ser 


Pro 


Arg 


Gin 


Thr 


Arg 


Cys 


Leu 


He 


He 


Gly 


Ser Gly Pro 


Ala 








20 










25 










30 






Gly 


Tyr 


Thr 


Ala 


Ala 


He 


Tyr 


Ala 


Ala 


Arg 


Ala 


Leu 


Leu 


Lys 


Pro 


Val 


35 










40 










45 








Leu 


He 


Ala Gly 


He 


Gin 


Pro 


Gly 


Gly 


Gin 


Leu 


Thr 


He 


Thr 


Thr 


Asp 




50 










55 










60 










Val 


Glu 


Asn 


Tyr 


Pro 


Gly 


Phe 


Ala 


Asp 


Val 


He 


Gin 


Gly 


Pro 


Trp 


Leu 


65 








70 










75 










80 


Met 


Asp 


Gin 


Met 


Arg 


Ala 


Gin 


Ala 


Glu 


His 


Val 


Gly 


Thr 


Glu 


Phe 


Val 








85 










90 










95 




Ser 


Asp 


He 


Val 


Thr 


Ser 


Val 


Asp 


Leu 


Ser 


Lys 


Arg 


Pro 


Phe 


Thr 


Val 






100 








105 










110 






Lys 


Thr 


Asp 


Ser 


Gly 


Gin 


Asp 


Trp 


He 


Ala 


Glu 


Thr 


He 


He 


He 


Ala 




115 










120 










125 








Thr 


Gly 


Ala 


Gin 


Ala 


Lys 


Trp 


Leu 


Gly 


Leu 


Glu 


Ser 


Glu 


Ala 


Lys 


Phe 




130 








135 










140 










Gin 


Gly 


Phe 


Gly 


Val 


Ser 


Ala 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 


145 






150 










155 










160 


Arg 


Asn 


Lys 


Asp 


Val 


He 


Val 


Val 


Gly 


Gly 


Gly 


Asn 


Thr 


Ala 


Val 


Glu 




165 










170 










175 




Glu 


Ala 


Leu 


Phe 


Leu 


Thr 


Ser 


Phe 


Ala 


Ser 


Lys 


Val 


Thr 


Leu 


Val 


His 
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180 




u y o 




Glu 

\J -X. V--*- 






195 




n"l -3 




-L -L w 






210 






T 

J_iC? Li 






J. 1J..L. 


225 








Val 


Lys 


Thr 


Gly 


tv n 


lie 




IX -1- O 








260 


Thr 


Gly 


Ser 


Gly 






275 




Ala 


He 


Glu 


Gly 




290 






Arg 


Gin 


Ala 


Val 


305 








Ala 


Val 


Arg 


Phe 


lie 


Ser 


His 


Ala 








340 



Leu Arg Ala Glu 

200 

He Glu Val He 
215 

Asp Pro Met Gly 
230 

Glu. Thr Gin Glu 
245 

Ala Pro Ser Ser 

Gly Tyr Leu Lys 

280 

Val Tyr Ala Ala 
295 

Thr Ala Ala Gly 
310 

Leu Ala Glu Glu 
325 

Glu Ala Asn Lys 



185 



Lys 


He 


Leu 


Gin 


Trp 


Asp 


Ser 


Val 








220 


Val 


Thr 


Gly 


Ala 






235 




Val 


Ala 


Ala 


Asp 




250 






Glu 


Leu 


Phe 


Ala 


265 








Val 


Lys 


Pro 


Gly 


Gly 


Asp 


Val 


Thr 








300 


Met 


Gly 


Cys 


Met 






315 




Asp 


His 


Lys 


Ala 




330 






He 


Gly 


Val 


Trp 



345 



190 



Glu 


Arg 


Leu 


Leu 


205 








He 


Asp 


Glu 


Val 


Arg 


Leu 


Lys 


Asn 








240 


Gly 


Val 


Phe 


He 






255 




Gly 


Gin 


Leu 


Glu 




270 






Thr 


Ala 


Ser 


Thr 


285 








Asp 


Asp 


Val 


Tyr 


Ala 


Ala 


Leu 


Glu 








320 


Ala 


His 


His 


Pro 






335 





<210> 241 
<211> 285 
<212> PRT 

<213> Clostridium acetobutylicum 



<400> 241 



Met 


Glu 


Arg 


Tyr 


1 








Ala 


Ser 


Ala 


He 








20 


Gly 


Ser 


Ser 


Asp 






35 




Asn 


Tyr 


Leu 


Gly 




50 






Phe 


Lys 


Glu 


His 


65 








Val 


Asn 


Asn 


He 


Lys 


Asp 


Thr 


Tyr 








100 


His 


Thr 


Lys 


Pro 






115 




Gly 


Tyr 


Cys 


Ala 




13 0 






Thr 


He 


Val 


Gly 


145 








Ala 


Glu 


Leu 


Ala 


Tyr 


Gin 


Leu 


Val 








180 


He 


Val 


Gly 


Asp 






195 




Leu 


Glu 


Thr 


Asp 




210 






Gin 


Leu 


Val 


Pro 


225 








Arg 


Lys 


Met 


Glu 


Thr 


Gly 


Lys 


Pro 








260 


Ala 


Ala 


Leu 


Asn 






275 





Asp 
5 


He 


Ala 


He 


Asn 


Ala 


Lys 


Thr 


Leu 


Ser 


Lys 


Lys 








40 


Phe 


Tyr 


Gly 


He 






55 




He 


Asp 


Asn 


Met 




70 






Tyr 


Ala 


Met 


Gly 


85 








Glu 


Ala 


Ser 


Lys 


Leu 


Lys 


Gly 


Glu 








120 


Thr 


Cys 


Asp 


Ala 






135 




Tyr 


Asn 


Lys 


Glu 




150 






Ser 


Lys 


Val 


Tyr 


165 








Ser 


Ala 


Val 


Glu 


Lys 


Lys 


Val 


Glu 








200 


Gly 


Val 


Phe 


Val 






215 




Gly 


Leu 


Tyr 


Val 




230 






Thr 


Asn 


He 


Asp 


245 








Tyr 


Gin 


Tyr 


Met 


Ala 


Val 


Glu 


Lys 








280 



He 


Gly 


Ser 


Gly 




10 






Arg 


Asn 


Lys 


Ser 


25 








Leu 


Thr 


Leu 


Ala 


Arg 


Gly 


Ala 


Glu 








60 


Gly 


He 


Gin 


He 






75 




Glu 


Tyr 


Phe 


Ser 




90 






Val 


He 


Leu 


Ala 


105 








Asp 


Lys 


Phe 


Leu 


Pro 


Leu 


Tyr 


Lys 








140 


Ala 


Glu 


Ser 


GlU 






155 




Tyr 


Val 


Pro 


Arg 




170 






He 


Val 


Lys 


Asp 


185 








Lys 


Leu 


Lys 


Leu 


Leu 


Lys 


Asp 


Ser 








220 


Glu 


Asp 


Gly 


His 






235 




Gly 


Cys 


Tyr 


Ala 




250 






Lys 


Ala 


Val 


Gly 


265 








Leu 


Tyr 


Thr 


Lys 



Pro 


Ala 


Gly 


Leu 






15 




Val 


He 


Val 


Phe 




30 






Pro 


Val 


He 


Asn 


45 








Leu 


Gin 


Glu 


Lys 


Glu 


Asn 


Val 


LyB 








80 


He 


Met 


Thr 


Ser 






95 




Met 


Gly 


Met 


Glu 




110 






Gly 


Arg 


Gly 


Val 


125 








Gly 


Lys 


He 


Val 


Ala 


Asn 


Tyr 


Leu 








160 


Tyr 


Lys 


Asp 


Glu 






175 




Val 


Pro 


Val 


Glu 




190 






Lys 


Ser 


Arg 


Glu 


205 








Ala 


Pro 


Pro 


Glu 


He 


Lys 


Val 


Asn 








240 


Ala 


Gly 


Asp 


Cys 






255 




Glu 


Gly 


Gin 


Val 




270 







Ala 
285 
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<210> 242 
<211> 291 
<212> PRT 

<213> Clostridium acetobutylicum 



<400> 242 



Met 


Asp 


Arg 


Tyr 


Asp 

XT 


He 


Ala 


He 


He 


Gly 


Ser 


Gly 


Pro 


Ala 


Gly 


Leu 


1 








5 










10 










15 




Ser 


Ala 


Ala 


He 


Asn 


Ala 


Val 


He 


Arg Asn 


Lys 


Lys 


Val 


lie 


Leu 


Phe 








20 










25 






30 






Gly 


Ser 


Asp 
35 


Asn 


Leu 


Ser 


Asn 


Lys 
40 


Leu 


Leu 


Lys 


Ala 


Pro 
45 


Lys 


He 


Asn 


Asn 


Tyr 


Leu 


Gly 


He 


Tvr 

J; 


Asp 


Val 


Ser 


Gly Lys 


Glu 


Leu 


Lys 


Glu 


Lys 




50 










55 










60 










Phe 


Leu 


Glu 


Hi s 


Leu 


Lys 


Tyr 


Met 


Asn 


He 


Glu 


He 


Lys 


Asn 


Glu 


Lys 


65 










70 










75 








80 


Val 


Asn 


Ser 


Val 


Tvr 
85 


Ser 


Met 


Gly 


Asp 


Tyr 


Phe 


Ala 


Leu 


Ser 


Leu 
95 


Asn 


Gin 


Lys 


Met 


Tyr 
100 


Glu 


Ala 


Thr 


Ser 


He 
105 


He 


He 


Ala 


Ser 


Gly 
110 


Val 


Glu 


Phe 


Ser 


Lys 


Pro 


Leu 


Asn 


Gly 


Glu 


Asp 


Glu 


Leu 


Leu 


Gly 


Lys 


Gly Val 






115 










120 










125 








Gly Tyr Cys 


Ala 


Thr 


Cys 


Asp 


Ala 


Pro 


Leu 


Tyr 


Lys 


Gly 


Lys 


Thr 


Val 




130 










135 










140 










Ala 


He 


Val 


Glv 


Tvr 


Thr 


Lvs 


Glu 

V^- 


Ala 


Glu 


Glu 


Glu 


Ala 


Asn 


Tyr 


Val 


14 5 










150 










155 










160 


Ser 


Glu 


Leu 


Ala 


Gly 
165 


Lys 


Leu 


Tyr 


Tyr 


He 
170 


Pro 


Met 


Tyr 


Lys 


Asp 
175 


Lys 


Val 


Ser 


Leu 


Lys 
1B0 


Glu 


Val 


He 


Glu 


Val 
185 


Val 


Glu 


Asp 


Lys 


Pro 
190 


He 


Ser 


He 


Leu 


Gly 

195 


Lys 


Asp 


Lys 


Val 


Ser 
200 


Gly 


Leu 


Gin 


Met 


Ser 
205 


Lys 


Gly 


Glu 


He 


Asn 
210 


Thr 


Asp 


Ala 


Val 


Phe 
215 


He 


He 


Lys 


Asp 


Ser 
220 


Val 


Ser 


Pro 


Gly 


Lys 


Leu 


Val 


Pro 


Gly 


Leu 


Leu 


Met 


Asn 


Gly 


Glu 


His 


He 


Ala 


Val 


Asp 


225 










230 










235 










240 


He 


Asp 


Met 


Lys 


Thr 


Asn 


He 


Glu 


Gly 


Cys 


Phe 


Ala 


Ala 


Gly Asp 


Cys 










245 










250 










255 




Ala 


Gly Arg 


Pro 


Tyr 


Gin 


Tyr 


He 


Lys 


Ser 


Ala 


Gly 


Gin 


Gly 


Gin 


He 








260 










265 










270 






Ala 


Ala 


Leu 
275 


Ser 


Ala 


Val 


Ser 


Tyr 
280 


He 


Asp 


Lys 


He 


Lys 
285 


Leu 


Asn 


Lys 



Lys He He 
290 



<210> 243 
<211> 314 
<212> PRT 

<213> Clostridium sticklandii 



<400> 243 



Met 


Ser 


Lys 


He 


Tyr 


Asp 


Leu 


Val 


lie 


He Gly Ala 


Gly 


Pro 


Ala 


Gly 


1 








5 










10 






15 




Leu 


Ser 


Ala 


Gly 


Leu 


Tyr 


Gly 


Ala 


Arg Gly Lys Met 


Ser 


Thr 


Leu 


He 








20 










25 






30 






He 


Glu 


Lys 


Asp 


Lys 


Thr 


Gly 


Gly 


Gin 


He Val Thr 


Thr 


Glu 


Glu 


Val 






35 










40 






45 








Ala 


Asn 


Tyr 


Pro 


Gly 


Ser 


He 


His 


Asp 


Ala Ser Gly 


Pro 


Ser 


Leu 


He 




50 










55 






60 










Ala 


Arg 


Met 


Ala 


Glu 


Gin 


Ala 


Asp 


Glu 


Phe Gly Thr 


Glu 


Arg 


He 


Lys 


65 










70 








75 








80 


Asp 


Ser 


He 


Val 


Asp 


Phe 


Asp 


Phe 


Thr 


Gly Lys He 


Lys 


He 


Leu 


Lys 










85 










90 






95 




Gly 


Thr 


Lys 


Ala 


Glu 


Tyr 


Gin 


Ala 


Lys 


Ala Val He 


Val 


Ala 


Thr 


Gly 








100 










105 






110 






Ala 


Ser 


Pro 


Lys 


Lys 


Leu 


Asp 


Cys 


Pro 


Gly Glu Lys 


Glu 


Leu 


Thr 


Gly 



-143- 



WO 02/50289 



PCT/US0 1/50240 







TIC 

115 




Lys 


Gly 


Val 


Ser 




13 0 






Met 


Glu 


Val 


Phe 


145 








Met 


Tyr 


Leu 


Thr 


Asp 


Ser 


Leu 


Arg 








18 0 


Pro 


Lys 


He 


Asp 






1 or 

195 




Asp 


Gly 


He 


Val 




210 






Leu 


Ser 


Glu 


His 


225 








Val 


Phe 


Thr 


Gly 


Asp 


Met 


Asn 


Gin 








260 


Asn 


He 


Pro 


Gly 






275 




Arg 


Gin 


Val 


Val 




290 






Ala 


Glu 


Lys 


Tyr 


305 















120 


Tyr 


cys 


Ala 


Thr 






135 




Val 


Val 


Gly Gly 




J. 3 u 






Lys 




Ala 


C? Q 'V 


165 








Ala 


Ala 


Lys 




Pne 


Lys 


Trp 


Asp 








200 


Glu 


Ser 


Val 


Val 






215 




Phe 


Ala 


Asp 


Glu 




230 






Tyr 


He 


Pro 


Gin 


245 








Ser 


Gly Tyr 


Phe 


Val 


Phe 


Ala 


Ala 








280 


Thr 


Ala 


Thr 


Ala 






295 




He 


Glu 


His 


Glu 




310 







Cys Asp Ala Asp 

14 0 

Gly Asp Ser Ala 
155 

Lys Val Thr He 
170 

He Gin Asp Lys 
185 

Ser Val He Lys 

Phe Glu Asn Thr 

220 

Glu Phe Gly Thr 
235 

Thr Asp He Phe 
250 

Val Thr Asn Gin 
265 

Gly Asp Cys Arg 

Asp Gly Ala He 

300 

Gly Leu 



125 

Phe Phe Gin Asp 

Val Glu Glu Ala 

160 

Val His Arg Arg 
175 

Ala Phe Ala Asn 
190 

Glu He Lys Gly 
205 

Lys Thr Gly Glu 

Phe Gly He Phe 

240 

Lys Asp Lys Val 
255 

Asn Met Glu Thr 
270 

Glu Lys Val Leu 
2 85 

Ala Ala He Met 



<210> 244 
<211> 325 
<212> PRT 

<213> Deinococcus radiodurane 



<400> 244 
Met Thr Ala 


Pro 


Thr 


Ala 


His 


Asp Tyr 


Asp 


Val 


Val 


He 


He 


Gly 


Gly 


1 








5 






10 










15 




Gly 


Pro 


Ala 


Gly 


Leu 


Thr 


Ala 


Ala He 


Tyr 


Thr Gly Arg 


Ala 


Gin 


Leu 






20 








25 










30 


Ala 




Ser 


Thr 


Leu 


He 


Leu 


Glu 


Lys 


Gly Met 


Pro Gly Gly Gin 


He 


Trp 






35 








40 








45 






Gly 


Ser 


Glu 


Glu 


Val 


Glu 


Asn 


Phe 


Pro Gly 


Phe 


Pro 


Glu 


Pro 


He 


Ala 




50 










55 








60 










Met 


Glu 


Leu 


Ala 


Gin 


Arg 


Met 


His Gin 


Gin 


Ala 


Glu 


Lys 


Phe 


Gly Ala 


65 










70 








75 






Ala 


Thr 


80 


Lys 


Val 


Glu 


Met 


Asp 


Glu 


Val 


Gin Gly Val 


Gin 


His 


Asp 


Ser 








85 








90 










95 


Ala 


His 


Pro 


Tyr 


Pro 


Phe 


Thr 


Val 


Arg Gly Tyr Asn Gly 


Glu 


Tyr 


Arg 






100 








105 










110 




He 


Lys 


Ala 


Val 


He 


Leu 


Ala 


Thr 


Gly Ala 


Asp 


Pro 


Arg 


Lys 


Leu Gly 




115 










120 








125 




Ala 


Thr 


Pro 


Gly Glu 


Asp 


Asn 


Phe 


Trp 


Gly Lys 


Gly Val 


Ser 


Thr 


Cys 




130 








135 








14 0 








Gly 


Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 


Lys 


Gly Lys 


Lys 


Val 


Val 


Val 


He 


Gly 


145 






15 0 








155 






Phe 


Ala 


160 


Glv Asp 


Ala 


Ala 


Val 


Glu 


Glu 


Gly Met 


Phe 


Leu 


Thr 


Lys 


Asp 










165 








170 










175 


Val 


Glu 


Val 


Thr 


Val 


He 


His 


Arg 


Arg Asp 


Thr 


Leu 


Arg 


Ala 


Asn 


Lys 








180 






185 










190 






Ala 


Gin 


Ala 


Arg 


Ala 


Phe 


Ala 


Asn Pro 


Lys 


Met 


Lys 


Phe 


He 


Trp 


Asp 






195 








200 








205 








Thr 


Ala 


Val 


Glu 


Glu 


He 


Gin 


Gly Ala 


Asp 


Ser 


Val 


Ser 


Gly Val 


Lys 




210 










215 








220 








Gly 


Leu 


Arg 


Asn 


Leu 


Lys 


Thr Gly 


Glu Val 


Ser 


Glu 


Leu 


Ala 


Thr Asp 


225 






230 








235 










240 


Val 


Phe 


He 


Phe 


He 


Gly His 


Val Pro 


Asn 


Thr 


Ala 


Phe 


Val 


Lys 


Asp 










245 








250 










255 


He 


Thr 


Val 


Ser 


Leu 


Arg 


Asp 


Asp 


Gly Tyr 


Val 


Asp 


Val 


Arg 


Asp 


Glu 








260 






265 










270 
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Tyr Thr Asn lie 

275 

lie Tyr Arg Gin 
290 

Met Met Thr Glu 
305 

Val Thr Ala Ala 



Pro Met Leu Phe 

280 

Leu Ala Thr Ser 
295 

Arg Gin Leu Ala 
310 

Asp 
325 



Ala Ala Gly Asp 

Val Gly Ala Gly 

300 

Ala Leu Glu Val 
315 



Val Ser Asp Tyr 
285 

Thr Arg Ala Ala 

Glu Gly Glu Glu 

320 



<210> 245 

<211> 61 

<212> PRT 

<213> Enterococcus faecalis 
<220> 

<221> VARIANT 

<222> 33, 45, 46 

<223> Xaa = Any Amino Acid 



Met Met Asp Thr 
1 

Thr Thr Thr Ser 

20 

Xaa Pro Glu Leu 
35 

Val Ala Phe Thr 
50 



Leu He He Glu 
5 

Glu He Val Asn 

Met Gly Glu Met 

40 

Lys Asp Glu He 
55 



Lys Asp Lys He 
10 

Tyr Pro Ala He 
25 

Arg He Gin Ala 

He Asp Val Asp 

60 



Gly Gly Gin Val 
15 

Arg His Thr Thr 
30 

Xaa Xaa Phe Gly 

45 

Phe 



<210> 246 
<211> 205 
<212> PRT 

<213> Halobacterium sp 



<400> 246 




Met 


Thr 


Glu Asp 


1 








Gly 


Leu 


Ser 


Ala 








20 


Val 


Leu 


Glu 


Gly 






35 




Val 


Glu 


Asn 


Tyr 




50 






Val 


Gin 


Arg 


Gly 


65 








His 


Gly 


Arg 


He 


Ser 


Leu 


Ser 


Thr 








100 


Thr 


Gly 


Ala 


Ser 






115 




Met 


Gly 


Ala Gly 




130 






Arg 


Gly 


Asp 


Asp 


145 








Glu 


Ala 


Leu 


Phe 


Arg 


Arg 


Glu 


Glu 








180 


Asp 


His 


Asp 


Asp 






195 





Ser His Asp Leu 
5 

Ala Val Tyr Ala 

Asp Glu Pro Gly 

40 

Leu Gly Phe Pro 
55 

Lys Glu Gin Ala 
70 

Glu Ala Ala Asp 
85 

Gly Asp Thr Leu 

Ala Arg Trp Val 

120 

Leu Ser Thr Cys 
135 

Val Leu Val Val 
150 

Leu Ala Lys Phe 
165 

Leu Arg Ala Ser 

Val Gin Phe Arg 

200 



Val He Ala Gly 
10 

Ala Arg Ala Asp 
25 

Gly Gin Leu Thr 

Asp Gly Val Gly 

60 

Glu Gin Phe Gly 
75 

Leu Asp Gly Gin 
90 

Tyr Thr Arg Ser 
105 

Gly Ala Glu Asn 

Ala Thr Cys Asp 

140 

Gly Gly Gly Asp 
155 

Ala Asp Ser Val 
170 

Glu He Met Ala 
185 

Trp Asn Thr Glu 



Ser Gly He Ala 
15 

Leu Glu Pro Leu 
30 

Leu Thr Thr Asp 
45 

Gly Met Asp Leu 

Ala Gin Phe Glu 

80 

Pro Leu Glu Leu 
95 

Leu He Val Ala 
110 

Glu Asp' Glu Leu 
125 

Gly Ala Phe His 

Ser Ala Met Glu 

160 

Thr Val Val His 
175 

Asp Arg Ala Arg 
190 

Leu 
205 



<210> 247 
<211> 362 
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<212> PRT 

<213> Halobacterium sp 



<400> 247 



Met 


Thr 


Glu 


Ala 


Thr 


Ala 


Asp 


Arg 


Thr 


Ala 


Leu 


Thr 


Asp 


Gly 


Gly- Arg 


1 








5 










10 










15 




Asp 


Val 


Val 


Glu 
20 


His 


Arg 


Gin 


Leu 


Val 
25 


He 


Val 


Gly 


Ser 


Gly 
30 


He 


Ala 


Ala 


Leu 


Ser 
35 


Ala 


Ala 


Thr 


Tyr 

-i 


Ala 
40 


Ala 


Arg 


Ser 


Asn 


Asn 
45 


Asp 


Pro 


Leu 


Leu 


Phe 


Glu 


Gly 


Asp 


Glu 


Pro 


Gly Gly 


Gin 


Leu 


Thr 


Leu 


Thr 


Ser 


Glu 




50 










55 










60 










Val 


Glu 


Asn 


Tyr 


Pro 


Gly 


Phe 


Pro 


Glu 


Gly 


He 


Ala 


Gly 


Ala 


Glu 


Leu 


65 










70 








75 








80 


lie 


Gin 


Glu 


Met 


Lys 


Thr 


Gin 


Ala 


Thr 


Arg 


Phe Gly Ala 


Glu 


Val 


GlU 










85 










90 










95 




His 


Gly 


lie 


Val 


Glu 


Ser 


Val 


Asp Asp 


Ser Gly Arg 


Pro 


Phe 


Arq 


Leu 








100 










105 










110 






Thr 


Leu 


Thr 


Asn 


Gly Asp 


Val 


Tyr 


Thr 


Ala 


Asp 


Ala 


Val 


He 


Val 


Ala 






115 










120 










125 








Ser 


Gly 
130 


Ala 


Ser 


Ala 


Arg 


Thr 
135 


Leu 


Gly 


He 


Pro 


Gly 
140 


Glu 


Asd 


Glu 


Leu 


Met 


Gly 


Gin 


Gly 


Val 


Ser 


Thr 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Ala 


Phe 


Phe 


145 










150 










155 










160 

_j_ \j \j 


Arg 


Gly 


Glu 


Asp 


Met 


lie 


Val 


Val 


Gly 


Gly 


Gly Asp 


Ala 


Ala 


Ala 


Glu 










165 










170 










175 




Glu 


Ala 


Ser 


Phe 
180 


Leu 


Thr 


Lys 


Phe 


Ala 
185 


Asp 


Thr 


Val 


Tyr 


Leu 
190 


Val 


His 


Arg Arg 


Asp 


Glu 


Leu Arg 


Ala Glu Asp 


Tyr 


Trp 


Ala 


Asp 


Arcr 


He 


Arg 






195 










200 










205 








Glu 


His 
210 


Val 


Ala 


Asp 


Gly 


Asp 
215 


He 


Glu 


Val 


Leu 


Trp 

220 


Asn 


Thr 


Glu 


Ala 


Val 


Glu 


Val 


His 


Gly 


Ser 


Pro 


Glu 


Glu 


Gly 


Val 


Thr 


Gly 


Ala 


Ser 


Leu 


225 










230 










235 








OA 0 

A *± V 


Val 


Arg 


His 




Glu Gly 


His 


Pro 


Thr 


Ala 


Lys 


Leu 


Asp 




Asp 


Glu 










245 










250 










255 




Thr 


Glu 


Gin 


Leu 
260 


Glu 


Leu 


Asp 


He 


Gly 
265 


Ala 


Phe 


Phe 


He 


Ala 
270 


He 


Gly 


His 


Thr 


Pro 
275 


Asn 


Thr 


Ser 


Phe 


Leu 
280 


Ala 


Asp 


Thr 


Gly 


Val 
285 


val 


Cys 


Asp 


Asp 


Ala 


Gly 


Tyr 


Val 


Gin 


Thr 


val 


Gly Gly Ala Gly Gly 


Gly 


Gin 


Thr 




290 










295 










300 










Lys 


Thr 


Asp 


Val 


Thr Gly 


Val 


Phe Gly Ala Gly Asp Val 


Val 


Asp 


Tyr 


305 










310 










315 










320 


His 


Tyr 


Gin 


Gin 


Ala 
325 


Val 


Thr 


Ala 


Ala 


Gly 
330 


Met 


Gly 


Ser 


Lys 


Ala 
335 


Ala 


lie 


Asp 


Ala 


Asp 


Glu 


Tyr 


Leu 


Glu 


Ser 


Val 


Ala Asp 


Gly 


Val 


Thr 


Gly 








340 










345 










350 






Glu 


Thr 


Ala 
355 


Asp 


Ala 


Thr 


Pro 


Ala 
360 


Asp 


Asp 















<210> 248 
<211> 294 
<212> PRT 

<213> Halobacterium 
<400> 248 

Met Pro Thr Gin Asp Gly Glu Arg 

1 5 
Gly Pro Ala Gly Cys Ala Ala Gly 

20 

Asp Thr Val Val Phe Asp Arg Gly 

35 40 
Phe Val Glu Asn Tyr Pro Gly Phe 

50 55 
Leu Arg Gly Leu Phe His Asp His 



Arg Asp Val Val He Val Gly Gly 

10 15 
Val Phe Thr Ala Arg Tyr Gly Leu 
25 30 
Asn Ala Ala Leu Pro Arg Cys Ala 

45 

Pro Gly Gly He Asp Val Pro Thr 

60 

Ala Glu Thr Ala Gly Cys Asp Leu 
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65 








lie 


Ala 


Asp 


Thr 


Gly 


Phe 


Val 


Val 








inn 
1UU 


Val 


Leu 


Ala 


Ala 






Tin 

lib 




Gly 


A.sp 


Ser 


Ala 










Arg 


Phe 


Asp 


Asp 


145 








Gly 


Leu 


Tyr 


Val 


lie 


Ala 


Ala 


Gly 








18 0 


Arg 


Lys 


Arg 


Ala 






195 




Trp 


Lys 


Arg 


Arg 




210 






Arg 


Trp 


Arg 


Glu 


225 








Asp 


Asp 


Asp 


Glu 


Phe 


Asp 


Ala 


Thr 








260 


Gly 


Ala 


His 


Arg 






275 




Tyr 


Arg 


Glu 


Gin 



290 



70 

Val Glu Ser Val 
85 

Glu Thr Gin Asp 

Ala Trp Tyr Asp 

120 

Phe Glu Thr His 
135 

Ala Tyr Ala Asp 
150 

Ala Ser Pro Gly 
165 

Asn Gly Ala His 

Arg Gly Tyr Pro 

20 0 

Glu Ser Asp Leu 
215 

Trp Phe Ala Ala 
230 

Phe Ala Ala Leu 
245 

Leu Ser Ala Asp 

Leu Leu Asp His 

280 

Arg Asp 



75 

Asp Arg Pro Ser 
90 

Gly Arg Arg Val 
105 

Gly Ser Tyr Leu 

Asp His His Gly 

140 

Ala Asp Gly Arg 
155 

Gly Gin Arg Ser 
170 

Val Ala Arg Cys 
185 

Glu Gly Val Ala 

Ser Gly Glu Trp 

220 

Glu Ala Gly Asp 
235 

Arg Ala Ala His 
250 

Ala He Glu Glu 
265 

lie Asp Asp Asp 



80 

Asp Asp Asp Thr 
95 

Tyr Thr Asp Thr 
110 

Arg Pro Val Val 
125 

Glu Ser Arg Glu 

Thr Pro Val Asp 

160 

Ala Gin Ala Val 
175 

Leu Leu Ala Asp 
190 

Pro His Tyr Asp 
205 

Ala Asp Arg Asp 

Asp Hi s Asp Leu 

240 

Leu Asp Arg Thr 
255 

Arg Ala Glu Ala 
270 

His lie Glu Ser 
285 



<210> 249 

<211> 324 

<212> PRT 

<213> Helicobacter pylori 



<400> 249 



Met 


Asn 


Gin 


Glu 


He 


Leu 


Asp 


Val 


Leu 


He 


Val 


Gly Ala 


Gly 


Pro 


Gly 


1 








5 










10 










15 




Gly 


He 


Ala 


Thr 


Ala 


Val 


Glu 


Cys 


Glu 


He 


Ala 


Gly 


Val 


Lys 


Lys 


Val 






20 










25 










30 






Leu 


Leu 


Cys 


Glu 


Lys 


Thr 


Glu 


Ser 


His 


Ser 


Gly 


Met 


Leu 


Glu 


Lys 


Phe 






35 










40 










45 








Tyr 


Lys 


Ala 


Gly Lys 


Arg 


He Asp 


Lys 


Asp 


Tyr 


Lys 


Lys 


Gin 


Val 


Val 


50 










55 










60 










Glu 


Leu 


Lys 


Gly 


His 


He 


Pro 


Phe 


Lys 


Asp 


Ser 


Phe 


Lys 


Glu 


Glu 


Thr 


65 








70 










75 










80 


Leu 


Glu 


Asn 


Phe 


Thr 


Asn 


Leu 


Leu 


Lys 


Glu 


His 


His 


He 


Thr 


Pro 


Ser 










85 










90 










95 




Tyr 


Lys 


Thr 


Asp 


He 


Glu 


Ser 


Val 


Lys 


Lys 


Glu 


Gly 


Glu 


Tyr 


Phe 


Lys 








100 










105 










110 






He 


Thr 


Thr 


Thr 


Ser 


Asn 


Thr 


Thr 


Tyr 


His 


Ala 


Lys 


Phe 


Val 


Val 


Val 






115 










120 










125 








Ala 


He 


Gly 


Lys 


Met 


Gly 


Gin 


Pro 


Asn 


Arg 


Pro 


Thr 


Ala 


Tyr 


Lys 


He 




130 








135 










140 










Pro 


Val 


Ala 


Leu 


Ser 


Lys 


Gin 


Val 


Val 


Phe 


Ser 


He 


Asn 


Asp 


Cys 


Lys 


145 










150 










155 










160 


Glu 


Asn 


Glu 


Lys 


Thr 


Leu 


Val 


He 


Gly 


Gly 


Gly Asn 


Ser 


Ala 


Val 


Glu 






165 










170 










175 




Tyr 


Ala 


He 


Ala 


Leu 


Cys 


Lys 


Thr 


Thr 


Pro 


Thr 


Thr 


Leu 


Asn 


Tyr 


Arg 






180 








185 










190 






Lys 


Lys 


Glu 


Phe 


Ser 


Arg 


He 


Asn 


Glu Asp 


Asn 


Ala 


Lys 


Asn 


Leu 


Gin 


195 










200 










205 








Glu 


Val 


Leu 


Asn 


Asn 


Asn 


Thr 


Leu 


Lys 


Ser 


Lys 


Leu Gly Val Asp 


He 




210 










215 










220 










Glu 


Ser 


Leu 


Glu 


Glu 


Asp 


Asn 


Thr 


Gin 


He 


Lys 


Val 


Asn 


Phe 


Thr 


Asp 


225 










230 










235 










240 
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Asn 


Thr 


Ser 


Glu 


Thr 


Pro 


Leu 


Glu 








260 


Thr 


Asn 


He 


Pro 






275 




Asn 


Leu 


Phe 


He 




290 






Ala 


Thr 


Ala 


Leu 


305 








Arg 


Leu 


His 


Ser 



Ser Phe Asp Arg 
245 ■ 

Phe Phe Lys Arg 

Val Val Lys Glu 

2B0 

Val Gly Asp He 
295 

Asn His Gly Tyr 
310 



Leu Leu Tyr Ala 
250 

Cys Ser Leu Glu 
265 

Asn Leu Glu Ser 

Leu Phe Lys Ser 

300 

Asp Val Ala He 
315 



He Gly Gly Ser 
255 

Leu Asp Pro Ser 
270 

Asn Asn He Pro 
285 

Gly Ala Ser He 

Glu He Ala Lys 

320 



<210> 250 

<211> 128 

<212> PRT 

<213> Klebsiella oxytoca 



<400> 250 




Met Gly 


Thr 


Ala 


1 






Ala Gly 


Tyr 


Thr 






20 


Val Leu 


He 


Thr 




35 




Glu Val 


Glu 


Asn 


50 






Leu Met 


Glu 


Arg 


65 






He Phe 


Asp 


His 


Leu Val 


Gly 


Asp 






100 


Tyr Arg 


Arg 


He 




115 





Lys His Ser Lys 
5 

Ala Ala Val Tyr 

Gly Met Glu Lys 

40 

Trp Pro Gly Asp 
55 

Met His Glu His 
70 

He Asn Ser Val 
85 

Ser Gly Glu Tyr 

Ser Ala Leu Ser 

120 



Leu Leu He Leu 
10 

Ala Ala Arg Ala 
25 

Gly Gly Gin Leu 

Pro Asn Asp Leu 

60 

Ala Thr Lys Phe 
75 

Asp Leu Gin Asn 
90 

Thr Cys Asp Ala 
105 

Gly Ser Ala He 



Gly Ser Gly Pro 
15 

Asn Leu Gin Pro 
30 

Thr Thr Thr Thr 
45 

Thr Gly Pro Leu 

Glu Thr Glu He 

80 

Arg Pro Phe Arg 
95 

Pro Asp Tyr Arg 
110 

Gly Arg Arg Val 
125 



<210> 251 
<211> 79 
<212> PRT 

<213> Lactococcus lactis 



<400> 251 
Met Gin Glu Leu 
1 

Tyr Ala Ala Phe 

2 0 

Glu Ser Ala Gin 
35 

Lys Leu He Tyr 
50 

Leu Thr Lys Asn 
65 



Asp Leu He He 
5 

Tyr Ala Gly Met 

Val Pro Gly Gly 

40 

Asp He Ala Gly 

55 

Leu Leu Glu Gin 
70 



Val Gly Ala Gly 
10 

Arg Gly Leu Ser 
25 

Gin Pro Gin Asn 

Leu Pro Ala Val 

60 

Leu Ala Gin He 
75 



Pro Val Gly Leu 
15 

Val Ala He He 
30 

Leu Tyr Pro Glu 
45 

Thr Gly Ala Asp 
Ser His Arg 



<210> 252 
<211> 321 
<212> PRT 

<213> Lactococcus lactis 
<400> 252 

Met Gin Glu Leu Asp Leu He He Val Gly Ala Gly Pro Val Gly Leu 

15 10 15 

Tyr Ala Ala Phe Tyr Ala Gly Met Arg Gly Leu Ser Val Ala He He 

20 25 30 

Glu Ser Ala Gin Val Pro Gly Gly Gin Pro Gin Asn Leu Tyr Pro Glu 
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35 










4 U 


Lys 


Leu 


lie 


Tyr 


Asp 


I le 


a._l a 


L»xy 




c n 
b U 














Leu 


Thr 


Lys 


Asn 


Leu 


Leu 


ulu 




O 3 








/ u 






Fne 


Leu 


Gly Glu 


Ser 


vaj. 


m t-i 
n 


T i ffi 

j->y s 










85 








Ser 


Val 


Thr 


Thr 


Asp 


Lys 




mr 








100 










Thr 


Thr 


Gly Ala Gly 


Leu 


Leu 


Lys 






115 










12 0 


Glu 


Glu 


Thr 


Leu 


Ala 


Asn 


Glu 


Gly 




130 










13 5 




Leu 


Lys 


Glu 


Phe 


Glu 


Gly 


Lys 


Asn 


145 










150 






Ser 


Ala 


Leu Asp 


Trp 


Ser 


Leu 


Met 










165 








•> • 
His 


Leu 


Val 


His 


Arg 


Arg 


Thr 


Ala 








180 










Asp 


Arg 


Val 


Met 


Asn 


Ser 


Asn 


Val 






195 










o n n 
z U U 


Ser 


Asn 


Leu 


lie 


Glu 


Asn 


GlU 


Leu 




<2 1 U 














Glu 


Ser 


Leu 


Asn 


Phe 


Ser 


lie 


Asp 


225 










230 






Leu 


Thr 


Asn 


Gin 


Val 


Thr 


Leu 


Ala 










245 








Gly 


Arg 


Val 


Lys 


Ala 


Asp 


Ser 


Met 








260 










Tyr 


Val 


Ala Gly Asp 


Ala 


Ser 


Asp 






275 










280 


Ser 


Val 


Gly 


Phe 


Gly 


Glu 


Ala 


Val 




2 9,0 










295 




Lys 


Leu 


Glu 


Phe 


Asp 


His 


Pro 


Leu 



305 310 
Phe 



<210> 253 
<211> 308 
<212> PRT 

<213> Lactococcus lactis 



<400> 253 



Met 


Thr 


Glu 


Lys 


Lys 


Tyr 


Asp 


Val 


1 








5 








Gly 


Met 


Thr 


Ala 


Ala 


Met 


Tyr 


Thr 








20 










Leu 


Leu 


Glu 


Arg 


Gly 


Val 


Pro 


Gly 






35 










40 


He 


Glu 


Asn 


Tyr 


Pro 


Gly 


Tyr 


Glu 




50 










55 




Met 


Lys 


Met 


Ala 


Glu 


Pro 


Leu 


Glu 


65 










70 






Gly 


Phe 


Val 


Thr 


Ala 


He 


Glu 


Asp 








85 








Thr 


Glu 


Asp 


Asp 


Glu 


Phe 


Val 


Thr 








100 










Ala 


Asn 


His 


Arg 


Lys 


Leu 


Glu 


He 






115 










120 


Arg 


Gly Val 


Ser 


Tyr 


Cys 


Ala 


Val 




130 










135 




Gin 


Glu 


He 


Leu 


Val 


lie 


Gly 


Gly 


145 










150 






Leu 


Tyr 


Leu 


Thr 


Arg 


Phe 


Gly 


Gin 



165 



45 





Pro 


Ala 


Val 

V -Am- 


Thr 


Gly Ala 


Asp 








d yj 










J—J ~ X 


Ala 


Gin 


He 


Ser 


His 




Leu 






/ -D 










80 


He 


Glu 


Lys 


Glu 


Glu Gly 


He 

m3m *k 


Phe 




90 










95 




Arg 


Arg 


Ala 


Lys 


Gly Val 


TiPU 


Leu 


105 










110 






Pro 


Arg 


Lys 


Leu 


Glv 


He 

>ii *A* >w 


A err") 


Asn 










125 








Lys 


He 


Ser 


Tyr 




j. J- 


Thr 


Ser 






140 










Val 


Ala 


Val 


Phe 


\si J. y 




m v 

VJ -!_ J 


Asn 






155 










160 

J- w W 


Leu 


Glu 


Lys 


Val 


A±a 


Lys 


lori 


Val 




170 










175 




Phe Arg Gly His 


Glu 


He 


T'Vl T* 

Xxl-L 


Val 


185 










190 






Gin 


Val 


His 


Thr 


Pro 


Tyr 


J- J.^ -L 


Phe 










205 








Glu 


Leu 


Lys 


Lys 


He 


Lys 


□ CJ. 


Glu 








220 










Lys 


He 


Leu 


Val 


Asn 


Tyr 


Gly 


Phe 




235 










240 


Glu 


Asn 


Leu 


Glu 


Val 


Ser 


Arg 


Asn 




250 










255 




Met 


Gin 


Ser 


Asn 


He 


Glu 


Gly 


Leu 


265 










270 






Tyr 


Pro Gly 


Lys 


Met 


Pro 


Leu 


Met 










285 








His 


Ala 


He 


Asn 


Ala 


Met 


Thr 


Lys 








300 










Arg 


Gly Gly 


His 


Ser 


Ser 


Ser 


He 






315 










320 



Val 


He 


He 


Gly 


Ser 


Gly 


Pro 


Ala 




10 










15 




Ala 


Arg 


Ser 


Glu 


Met 


Lys 


Thr 


Leu 


25 










30 






Gly Gin Met 


Asn 


Asn 


Thr 


Ala 


Glu 










45 








Thr 


He 


Met 


Gly 


Pro 


Glu 


Leu 


Ser 








60 










Gly 


Leu 


Gly 


Val 


Glu 


Asn 


Ala 


Tyr 




75 










80 


His 


Gly Asp 


Tyr 


Lys 


Lys 


He 


He 




90 










95 




Lys 


Ser 


He 


He 


He 


Ala 


Thr 


Gly 


105 










110 






Pro 


Gly 


Glu 


Glu 


Glu 


Tyr 


Gly 


Ala 








125 








Cya 


Asp Gly Ala 


Phe 


Phe 


Arg 


Asn 








14 0 










Gly Asp 


Ser 


Ala 


Val 


Glu 


Glu 


Ala 






155 










160 


Ser 


Val 


Thr 


He 


Met 


His 


Arg 


Arg 




170 










175 
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Asp 


Lys 


Leu 


Arg 








180 


Glu 


Lys 


He 


Asn 




1 Q C 

J. y d 




Asp 


Asp 


Lys 


Lys 




A J-U 






Glu 


Val 


Thr 


Glu 


225 








Asp 


Pro 


Val 


Ala 


Gly 


Trp 


He 


He 








260 


Phe 


Ala 


Val 


Gly 






275 




Ala 


He 


Gly 


Asp 




290 






Ala 


Glu 


Leu 


Asp 



305 



Ala Gin Glu He 

Phe He Trp Asp 

200 

Val Gin Ser Val 
215 

Lys Ala Phe Gly 
230 

Glu Phe Ala Gly 
245 

Thr Asp Asp His 

Asp Val Arg Gin 

280 

Gly Ala Gin Ala 
295 



He Gin Gin Arg 
185 

Ser Val Pro Met 

Val Tyr Lys Asn 

220 

Gly He Phe He 
235 

Asn Leu Gly He 
250 

Met Arg Thr Ser 
265 

Lys Asp Phe Arg 

Ala Gin Glu Ala 

300 



Ala Phe Lys Glu 
190 

Glu He Lys Gly 
205 

val Lys Thr Gly 

Tyr Val Gly Leu 

240 

Thr Asp Glu Ala 
255 

Leu Pro Gly He 
270 

Gin He Thr Thr 
285 

Tyr Lys Phe Val 



<210> 254 
<211> 44 
<212> PRT 

<213> Lactococcus lactis 
<400> 254 

Met Gin Glu Leu Asp Leu He He 

1 5 
Tyr Ala Ala Phe Tyr Ala Gly Met 

20 

Glu Ser Ala Gin Val Pro Gly Gly 
35 40 



Val Gly Ala Gly Pro Val Gly Leu 

10 15 
Arg Gly Leu Ser Val Ala He He 
25 30 
Gin Pro Gin Asn 



<210> 255 
<211> 339 
<212> PRT 

<213> Listeria monocytogenes 



<400> 255 
Glu Phe Tyr Ser 
1 

Asp Ser Ala Ser 

20 

Pro Asn Phe Trp 
35 

Gin Leu Lys Glu 
50 

Arg Asn Pro Gly 
65 

Phe Tyr Lys Gly 

Arg Asp Val Leu 

100 

He His Ala Met 
115 

Gin Val Ala Asp 
13 0 

Arg Ala Lys Ser 
145 

Phe Trp His Ser 

Pro Trp Glu Gin 

180 

Glu Met Leu Gin 
195 

Ala Leu Thr He 



Tyr 


Lys 


Lys 


Glu 


5 








Ala 


Cys 


Asp 


He 


Pro 


Ala 


Tyr 


Asn 








40 


Glu 


Glu 


Gly 


Val 






55 




Asn 


Leu 


Leu 


Gly 




70 






Asn 


Arg 


Lys 


Glu 


85 








Pro 


Val 


Leu 


Ala 


Met 


Gly 


Glu 


Tyr 








120 


Leu 


Glu 


Val 


Thr 






135 




Ser 


Tyr 


Tyr 


Leu 




150 






Phe 


Leu 


Glu 


Cys 


165 








Glu 


Pro 


Asp 


Leu 


Glu 


Glu 


Asp 


Asp 








200 


Ser 


Gly 


Asn Arg 



He Asn Arg Tyr 
10 

Leu Arg Lys Val 
25 

Gin Leu Ala Ser 

Arg Val Leu Ser 

60 

He Cys Asp Leu 
75 

Ala Asp Glu Leu 
90 

His His Lys Glu 
105 

Glu Glu Ala Asp 

Glu Arg Ser Lys 

140 

Gly Asp Val Glu 
155 

Asp Leu Tyr Glu 
170 

Thr Asn Asp Thr 
185 

Leu Thr His Met 
Pro Glu Leu Val 



Leu Ala Glu Glu 
15 

He Asp Glu Lys 
30 

Leu Tyr Phe Glu 
45 

Asp Leu Leu Ser 

Phe He Tyr His 

80 

Tyr Leu Glu Leu 
95 

Lys Leu Gly Leu 
110 

Asp Leu Leu Glu 
125 

Tyr Tyr Tyr Phe 

Gly Ala Lys Met 

160 

Asp Val Arg Phe 

17 5 

Arg Leu Val Leu 
190 

Leu Gly Val Tyr 
205 

Leu Phe His Pro 
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210 




















4S dZ \J 






Phe 


Thr 


Leu 


Leu 


Asp 


Met 


Ser 


ASp 


**Py 

x jrp 


Ser 


Tyr 


Met 


\JJL U. 


Hi c; 
ma 




Met 


O O fT 

225 








230 










*s J o 










240 


Asn 


Pne 


Asp 


Tyr 


T)Vl a 

irlLG. 


IT J-\J 


7\ nri 

/\sp 


Gly Ala 


lie 




Gin 

w ±11 


Asn 


Gly Tyr 


Leu 






245 










250 










255 




lie 


Ala 


Lys 


Ala 


Met 


lie 


lie 


Leu 


Lys 


Glu 


Asn 


Gly 


lie 


Leu 


Leu 


Asn 






260 










265 










270 






Glu 


Glu 


Tyr 


Met 


Ala 


Leu 


Tyr 


Lys 


Gin 


Met 


Phe 


Ser 


Leu 


Val 


Leu 


He 






275 










280 










285 






Val 


Asp 


Ala 


Gly 


Lys 


Asp 


Leu 


lie 


Leu 


Gly Arg 


Tyr 


Thr 


He 


Glu 


Thr 


290 








295 










300 










Ala 


Ser 


Ala 


lie 


Ala 


Lys 


Leu 


Phe 


Leu 


Pro 


His 


Leu 


Lys 


Leu 


Gin 


Leu 


305 










310 










315 










320 


Val 


Glu 


Glu 


Phe 


Glu 


Cys 


Ser 


Lys 


Cys 


Ala 


Arg 


Asp 


He 


Glu 


Arg 


Val 










325 






330 










335 




Leu 


Ser 


Arg 





























<210> 256 
<211> 303 
<212> PRT 

<213> Methanothermobacter thermautotrophicus 
<400> 256 



Met 


Met 


inr 


Asp 


iyr 


ASp 




He 


Val 


He 


Gly Ala 


Gly 


Pro 


Ala 


Gly 


1 






c 










10 










15 




Leu 


Thr 


Ala 


Gly 


He 


Tyr 


Gly 


Gly Arg 


Gin 


Gly 


Ser 


Ser 


Val 


Leu 


Met 








20 










25 










30 






Leu 


Asp 


Lys 


Gly 


Pro 


Ala 


Gly 


Gly 


Leu 


Gly 


Leu 


Glu 


val 


Pro 


Met 


Met 




35 










40 










45 






Thr 


Glu 


Asn 


Tyr 


Pro 


Gly 


Phe 


Glu 


Met 


He 


Ala Gly Met 


Ser 


Leu 


Val 




50 








55 










60 










Lys 


Met 


Lys 


Ly s 


Gin 


Ala 


Thr 


Ala 


Val 


Ala 


Glu 


Leu 


Arg 


Glu 


Met 


Glu 


65 






70 










75 










80 


Glu 


Val 


Lys 


Glu 


He 


Glu 


Lys 


Gly Asp 


Val 


Phe 


Thr 


Val 


Lys 


Thr 


Ser 








85 










90 










95 




Arg 


Asp 


Thr 


Tyr 


Thr 


Ala 


Ser 


Ala 


He 


He 


Phe 


Ala 


Thr 


Gly 


Ser 


Lys 




100 










105 










110 




Gly 


His 


Arg 


Gin 


Leu 


Gly 


Val 


Pro 


Gly 


Glu 


Asn Asp 


Leu 


Leu 


Gly 


Arg 




115 










120 










125 








Val 


Cys 


Tyr 


Cys 


Ala 


Thr 


Cys 


Asp 


Gly 


Pro 


Leu 


Tyr 


Lys 


Gly Arg 


Lys 




130 








135 










140 






Val 


Phe 


Val 


Leu 


Met 


Val 


Gly 


Gly 


Gly 


Asn 


Ser 


Ala 


Ala 


Gin 


Glu 


Ala 


145 








150 










155 










160 


Leu 


Lys 


Asn 


He 


Gly 


Cys 


Asp 


Val 


Ser 


He 


Val 


His 


Arg 


Arg 


Asp 


Glu 








165 










170 










175 




Leu 


Arg 


Ala 


Asp 


Lys 


Tyr 


Leu 


Gin Asp 


Lys 


Leu 


Arg 


Glu 


Met 


Glu 


He 






180 










185 










190 






Pro 


Val 


He 


Trp 


Asn 


Ser 


Val 


Val 


Lys 


Glu 


He 


Gly 


Gly 


Asp 


Glu 


Arg 






195 








200 










205 




Glu 


Thr 


Val 


Glu 


Glu 


Val 


He 


He 


His 


Asn 


Arg 


Val 


Thr 


Gly Arg 


Asp 




210 










215 










220 










Leu 


Lys 


Val 


Asp 


Gly 


Val 


Phe 


He 


Ala 


He 


Gly 


Glu 


Glu 


Pro 


Leu 


Asn 


225 






230 










235 










240 


Gin 


Leu 


Ala 


Val 


Asp 


Leu 


Gly 


Val 


Glu 


Val 


Asp 


Lys 


Gly 


Gly Tyr 


He 










245 








250 










255 




He 


Thr 


Asp 


Lys 


Phe 


Gin 


Arg 


Thr 


Asn 


Val 


Pro 


Leu 


Val 


Tyr 


Ala 


Ala 






260 










265 










270 






Gly Asp 


He 


Thr 


Gly 


Gly 


Leu 


Asn 


Gin 


Trp 


Val 


Thr 


Ala 


Cys 


Ala 


Glu 






275 










280 










285 








Gly Ala 


He 


Ala 


Ala 


Thr 


Tyr 


Ala 


Tyr 


Arg 


Glu 


He 


Gin 


Ser 


Tyr 






290 










295 










300 











<210> 257 
<211> 179 
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<212> PRT 

<213> Bacillus subtilis 



<400> 257 



Met Val lie 


Ser 


Gly 


Gly Gly Asp 


Thr 


Ala 


Val 


Asp Trp 


Ala 


Asn 


Glu 


1 








5 










10 








15 




Lieu 




Pro 


He 


Ala 


Ala 


Ser 


Val 


Thr 


Val 


Val 


His Arg 


Arg 


Glu 


GlU 








20 










25 








30 






Phe 


Gly 


Glv 


Met 


Glu 


Ser 


Ser 


Val 


Thr 


Lys 


Met 


Lys Gin 


Ser 


Ser 


Val 




35 










40 








45 








Arg 


Val 


Leu 


Thr 


Pro 


Tyr Arg 


Leu 


Glu 


Gin Leu Asn Gly 


Asp 


Glu 


Glu 


50 










55 










60 








Gly 


lie 


Lys 


Ser 


Val 


Thr 


Val 


Cys 


His 


Thr 


Glu 


Ser Gly 


Gin Arg 


Lys 


65 








70 










75 








80 


Asp 


lie 


Glu 


He 


Asp 


Glu 


Leu 


He 


He 


Asn 


Hi s 


Gly Phe 


Lys 


He 


Asp 








85 










90 








95 




Leu 


Gly 


Pro 


Met 


Met 


Glu 


Trp 


Glv 


Leu 


Glu 


He 


Glu Glu 


Gly Arg 


Val 






100 








105 








110 






Lys 


Ala 


Asp 


Arg 


• 

His 


Met 


Arg 


Thr 


Asn 


Leu 


Pro 


Gly Val 


Phe 


Val 


Ala 




115 










12 0 








125 








Gly 


Asp 


Ala 


Ala 


Phe 


Tyr 


Glu 


Ser 


Lys 


Leu 


Arq 


Leu He 


Ala 


Gly 


Gly 


130 










135 










140 








Phe 


Thr 


Glu 


Gly 


Pro 


Thr 


Ala 


Val 


Asn 


Ser 


Ala 


Lys Ala 


Tvr 


Leu 


Asp 


145 








150 










155 








160 


Pro 


Lys 


Ala 


Glu 


Asn 


Met 


Ala 


Met 


Tyr 


Ser 


Thr 


His His 


Lvs 


Lys 


Leu 








165 










170 








175 

-X- i >-J 




Val 


His 


Lys 


























<210> 258 


























<211> 307 


























<212> PRT 


























<213> Mycoplasma pulmonis 


















<400> 258 


























Met 


Ser 


Gin 


Asn 


Lys 


He 


Tyr 


Asp 


Val 


Ala 


He 


He Gly 


Ala 


Glv 


Pro 


1 








5 










10 








15 




Gly 


Ala 


Leu 


Thr 


Ala 


Ala 


lie 


Tyr 


Thr 


Ser 


Arg 


Gly Asn 


Leu 


Asp 


-■ 

Thr 






20 








25 








30 






Val 


Phe 


He 


Asp 


Asn 


Ala 


Ala 


Pro 


Gly Gly 


Lys 


Leu He 


Tyr 


Ala 


Ser 






35 








40 








45 








Lys 


lie 


Glu 


Asn 


Trp 


Pro 


Gly Asp 


Thr 


He 


Val 


Lys Gly 


Thr 


Asp 


Leu 


50 








55 










60 








Ala 


lie 


Arg 


Phe 


Phe 


Glu 


His 


Ala 


Gin 


Ala 


Phe 


Gly Ala 


Lys 


Tyr 


Glu 


65 








70 










75 








80 


Tyr 


Gly 


Lys 


Val 


Val 


Asp 


Leu 


He 


Asn 


He 


Lys 


Asp Asp 


Leu 


Lys 


Glu 




85 










90 








95 




Leu 


Val 


Leu 


Glu 


Asp 


Gly 


Lys 


Lys 


He 


Gin 


Ala 


Lys Ser 


Val 


He 


lie 








100 






105 








110 






Ala 


Ser 


Gly 


Met 


Val 


Ser 


Arg 


Lys 


Pro 


Arg 


Glu 


He Leu 


Asn 


Tyr 


Asp 






115 










120 








125 








Glu 


Phe 


Glu 


Asn 


Arg 


Gly 


Val 


Ser 


Tyr 


Cys 


Val 


He Cys 


Asp 


Gly 


Pro 




130 






135 










14 0 








Met 


Tyr Gly His 


Asn 


Pro 


Ala 


He 


He 


He Gly Gly Gly 


Asn 


Ser 


Ala 


145 










150 










155 








160 


Val 


Glu 


Glu 


Gly 


Thr 


Phe 


Leu 


Ser 


Ser 


He 


Ala 


Ser Lys 


Val 


Tyr 


Val 








165 










170 








175 




lie 


Val 


Arg 


Asp 


Ser 


Asp 


Phe 


He 


Ala 


Glu 


Lys 


Ala Leu 


Val 


Asn 


Asp 






180 










185 








190 






Leu 


Lys 


Ser 


Arg 


Lys 


Asn 


He 


Glu 


Val 


Leu 


Phe 


Asn Ala 


Ser 


Val 


Lys 




195 










200 








205 




His 




Glu 


Leu 


His 


Gly 


Lys 


Asp 


Ala 


Leu 


Glu 


Tyr 


Ala 


He Val 


Asn 


Asn 




210 










215 










220 








Gly 


Lys 


Glu 


Val 


Lys 


Leu 


Glu 


Val 


Ala 


Ser 


Leu 


Phe Pro 


Tyr 


He 


Gly 


225 






230 










235 








240 


Phe 


Leu 


Pro 


Ser 


Ala 


Glu 


Tyr Ala 


Lys 


Asn 


Ala 


Gly Val 


Leu 


Glu 


Pro 
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245 250 255 

Asn Gly Phe lie Lys Thr Asp Glu Phe Met Glu Thr Lys Val Pro Gly 

260 265 270 

lie Tyr Ala lie Gly Asp He Arg He Lys Asp He Arg Gin He Leu 

275 280 285 

Thr Ala Thr Ser Asp Gly Thr He Ala Gly Lys He Leu Thr Asn Arg 

290 295 300 

He Lys Lys 
305 



<210> 259 
























<211> 316 
























<212> PRT 
























<213> Neisseria 


meningitidis 
















<400> 259 




















Ala 


Glv 


1*1 ti L. 


Ser 


Gin 


His 


Arg 


Lys 


Leu He He 


Leu 


Gly 


Ser 


Gly Pro 


J. 








5 




10 












He 


Tyr 


Thr 


Ala 


Ala 


Val 


Tyr 


Ala Ala Arg 


Ala 


Asn 


Leu 


Asn 


Pro 


Val 






20 






25 










30 






He 


Thr 


Gly 


He 


Ala 


Gin 


Gly Gly Gin 


Leu 


Met 


Thr 


Thr 


Thr 


Glu 


Val 






35 








40 








45 








7\ «i t"~\ 


Asn 


Trp 


Pro 


Ala 


Asp 


Ala Asp Gly 


Val 


Gin 


Gly 


Thr 


Glu 




Met 


50 








55 






60 










^ J. ct 


Arg 


Phe 


Leu 


Ala 


His 


Ala Glu Arg 


Phe 


Gly 


Thr 


Glu 


He 


He 

Jk <Jb *w 


Phe 


b b 








70 






75 










80 


nop 


Gin 


He 


Asn 


Ala 


Val 


Asp Leu Gin 


Lys 


Arg 


Pro 


Phe 


Thr 




Lvs 








85 






90 










q c: 

_7 -J 




Gly 


Asp 


Met 


Gly 


Glu 


Tyr 


Thr Cys Asp 


Ala 


Leu 


He 


Val 


Ala 


Thr 

J- Hi- 


Glv 




100 






105 










110 






Ala 


Ser 


Ala 


Lys 


Tyr 


Leu 


Gly Leu Pro 


Ser 


Glu 


Glu 


Ala 


Phe 




Glv 






115 




12 0 








125 








Lys 


Gly Val 


Ser 


Ala 


Cys 


Ala Thr Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 




Asn 


130 










135 






140 








Ala 


Gin 


Asp 


Val 


Ala 


Val 


Val 


Gly Gly Gly Asn Thr Ala Val 


Glu 


Glu 


145 








150 






155 










160 


Leu' 


Tyr 


Leu 


Ala 


Asn 


He 


Ala Lys Thr 


Val 


Thr 


Leu 


He 


His 


Arg 


Arg 








165 




170 










175 




Ser 


Glu 


Phe 


Arg 


Ala 


Glu 


Lys He Met 


He 


Asp 


Lys 


Leu 


Met 


Lys 


Arg 








180 






185 










190 






Val 


Glu Glu Gly 


Lys 


He 


He Leu Lys 


Leu 


Glu 


Ser 


Asn 


Leu 


Gin 


Glu 






195 








200 








205 








Val 


Leu 


Gly Asp 


Asp 


Arg 


Gly Val Asn Gly Ala 


Leu 


Leu 


Lys 


Asn 


Asn 




210 










215 






220 








He 


Asp 


Gly 


Ser 


Glu 


Gin 


Gin 


He Ala Val 


Ser 


Gly 


He 


Phe 


He 


Ala 


225 








230 






235 










240 


Gly 


His 


Lys 


Pro 


Asn 


Thr 


Asp He Phe 


Lys 


Gly Gin 


Leu 


Glu 


Met 


Asp 






245 




250 










255 




Glu 


Ala 


Gly 


Tyr 


Leu 


Lys 


Thr Lys Gly 


Gly 


Thr 


Ala 


Asp 


Asn 


Val 


Gly 






260 






265 










270 




His 


Ala 


Thr 


Asn 


He 


Glu Gly 


Val Trp Ala 


Ala 


Gly Asp 


Val 


Lys 


Asp 






275 








280 








285 








Thr 


Tyr 


Arg 


Gin 


Ala 


He 


Thr Ser Ala 


Ala 


Ser Gly 


Cys 


Gin 


Ala 


Ala 




290 








295 






300 










Leu 


Asp 


Ala 


Glu 


Arg Trp 


Leu Gly Ser 


Gin 


Asn 


He 










305 








310 






315 













<210> 260 
<211> 316 
<212> PRT 

<213> Neisseria meningitidis 
<400> 260 

Met Ser Gin His Arg Lys Leu He He Leu Gly Ser Gly Pro Ala Gly 
1 5 10 15 
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Tyr Thr Ala Ala 

20 

He Thr Gly He 
35 

Asp Asn Trp Pro 
50 

Ala Arg Phe Leu 
65 

Asp Gin He Asn 

Gly Asp Met Gly 

100 

Ala Ser Ala Lys 
115 

Lys Gly Val Ser 
13 0 

Gin Asp Val Ala 
145 

Leu Tyr Leu Ala 

Ser Glu Phe Arg 

180 

Val Glu Glu Gly 
195 

Val Leu Gly Asp 
210 

Asp Gly Ser Glu 
225 

Gly His Lys Pro 

Glu Ala Gly Tyr 

260 

Ala Thr Asn He 
275 

Thr Tyr Arg Gin 
290 

Leu Asp Ala Glu 
305 



Val Tyr Ala Ala 

Ala Gin Gly Gly 

40 

Ala Asp Ala Asp 
55 

Ala His Ala Glu 
70 

Ala Val Asp Leu 
85 

Glu Tyr Thr Cys 

Tyr Leu Gly Leu 

120 

Ala Cys Ala Thr 
135 

Val Val Gly Gly 
150 

Asn He Ala Lys 
165 

Ala Glu Lys He 

Lys lie He Leu 

200 

Asp Arg Gly Val 
215 

Gin Gin He Ala 
230 

Asn Thr Asp He 
245 

Leu Lys Thr Lys 

Glu Gly Val Trp 

280 

Ala He Thr Ser 
295 

Arg Trp Leu Gly 
310 



Arg Ala Asn Leu 
25 

Gin Leu Met Thr 

Gly Val Gin Gly 

60 

Arg Phe Gly Thr 
75 

Gin Lys Arg Pro 
90 

Asp Ala Leu He 
105 

Pro Ser Glu Glu 

Cys Asp Gly Phe 

140 

Gly Asn Thr Ala 
155 

Thr Val Thr Leu 
170 

Met He Asp Lys 
185 

Lys Leu Glu Ser 

Asn Gly Ala Leu 

220 

Val Ser Gly He 
235 

Phe Lys Gly Gin 
250 

Gly Gly Thr Ala 
265 

Ala Ala Gly Asp 

Ala Ala Ser Gly 

300 

Ser Gin Asn He 
315 



Asn Pro Val He 
30 

Thr Thr Glu Val 
45 

Pro Glu Leu Met 

Glu He He Phe 

80 

Phe Thr Leu Lys 
95 

Val Ala Thr Gly 
110 

Ala Phe Ala Gly 
125 

Phe Tyr Lys Asn 

Val Glu Glu Ala 

160 

He His Arg Arg 
175 

Leu Met Lys Arg 
190 

Asn Leu Gin Glu 
205 

Leu Lys Asn Asn 

Phe He Ala He 

240 

Leu Glu Met Asp 
255 

Asp Asn Val Gly 
270 

Val Lys Asp His 
285 

Cys Gin Ala Ala 



<210> 261 
<211> 316 
<212> PRT 

<213> Pseudomonas aeruginosa 



<400> 261 



Met 


Ser 


Glu 


Val 


Lys 


His 


Ser 


Arg 


1 








5 








Ala 


Gly 


Tyr 


Thr 


Ala 


Ala 


Val 


Tyr 








20 










Val 


Val 


He 


Thr 


Gly 


lie 


Gin 


Pro 






35 










40 


Glu 


Val 


Asp 


Asn 


Trp 


Pro 


Gly 


Asp 




50 










55 




Leu 


Met 


Thr 


Arg 


Met 


Gin 


Gin 


His 


65 










70 






Val 


Tyr 


Asp 


His 


He 
85 


Hie 


Thr 


Ala 


Leu 


Lys 


Gly Asp 


Ser 


Gly 


Thr 


Tyr 








100 










Thr 


Gly Ala 


Ser 


Ala 


Gin 


Tyr 


Leu 






115 










12 0 


Met 


Gly Lys 


Gly 


Val 


Ser 


Ala 


Cys 




130 










135 




Arg 


Asn 


Gin 


Val 


Val 


Cys 


Val 


Val 


145 










150 






Glu 


Ala 


Leu 


Tyr 


Leu 


Ala 


Asn 


He 



Leu He He Leu Gly Ser Gly Pro 

10 15 
Ala Ala Arg Ala Asn Leu Lys Pro 
25 30 
Gly Gly Gin Leu Thr Thr Thr Thr 

45 

Val Glu Gly Leu Thr Gly Pro Ala 

60 

Ala Glu Arg Phe Asp Thr Glu He 

75 80 
Glu Leu Gin Gin Arg Pro Phe Thr 

90 95 
Thr Cys Asp Ala Leu He He Ala 
105 HO 
Gly Met Ser Ser Glu Glu Ala Phe 

125 

Ala Thr Cys Asp Gly Phe Phe Tyr 

140 

Gly Gly Gly Asn Thr Ala Val Glu 
155 160 
Ala Lys Glu Val His Leu He His 
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165 






170 




175 








/vsp 


T cs 
LiyS 


Leu 


Arg Ser 


Glu 


Lys lie 


T.£»n Gin 


Asn Lvs Leu 


Phe 


-LOU 








185 




190 




ASp 


JUy S 




uj.ll 


Asn 


Gly Asn 


Val 


His Leu 


Hi s TrD 


Asn Thr Thr 


Leu 


JL ^ -J 








200 






205 






m t» 


V dX 




Gly Asp Ala 


Ser 


Gly Val 


Thr Gly 


Val Arg Leu 


Lys 


5i n 

«5 x u 








215 






220 






O /^N 


xnr 


x j.e 


i\Bp 


Gly 


Ser Thr 


Ser 


Glu Leu 


Ser Leu 


Ala Gly Val 


Phe 


225 






230 






235 




240 


lie 


Ala 


lie 


Gly 


His 


Lys Pro 


Asn Thr Asp 


Leu Phe 


Gin Gly Gin 


Leu 








245 




250 




255 




Glu 


Met 


Arg 


Asp 


Gly 


Tyr Leu 


Arg 


He His 


Gly Gly 


Ser Glu Gly 


Asn 






260 








265 




270 




Ala 


Thr 


Gin 


Thr 


Ser 


He Glu 


Gly 


Val Phe 


Ala Ala 


Gly Asp Val 


Ala 






275 








280 






285 




Asp 


His 


Val 


Tyr 


Arg 


Gin Ala 


He 


Thr Ser 


Ala Gly Ala Gly Cys 


Met 


290 




295 






300 






Ala 


Ala 


Leu 


Asp 


Ala 


Glu Lys 


Tyr 


Leu Asp 


Asp His 






305 










310 






315 







<210> 262 
<211> 316 
<212> PRT 

<213> Pseudomonas aeruginosa 
<400> 262 

Met Pro Asp Thr Leu Arg His Ala Arg Val lie He Leu Gly Ser Gly 

IS 10 15 

Pro Ala Gly Tyr Ser Ala Ala Val Tyr Ala Ala Arg Ala Asn Leu Lys 

20 25 30 

Pro Leu Leu He Thr Gly Met Gin Ala Gly Gly Gin Leu Thr Thr Thr 

35 40 45 

Thr Glu Val Asp Asn Trp Pro Gly Asp Pro His Gly Leu Thr Gly Pro 

50 55 60 

Ala Leu Met Gin Arg Met Gin Glu His Ala Glu Arg Phe Glu Thr Glu 
65 70 75 80 

He Val Phe Asp His He His Ala Val Asp Leu Ala Gly Lys Pro Phe 

85 90 95 

Thr Leu Arg Gly Asp Asn Gly Thr Tyr Thr Cys Asp Ala Leu He Val 

100 105 HO 

Ala Thr Gly Ala Ser Ala Arg Tyr Leu Gly Leu Pro Ser Glu Gin Ala 

115 " 120 125 

Phe Met Gly Lys Gly Val Ser Ala Cys Ala Thr Cys Asp Gly Phe Phe 

130 * 135 140 

Tyr Arg Asn Arg Glu Val Ala Val He Gly Gly Gly Asn Thr Ala Val 
145 " 150 155 160 

Glu Glu Ala Leu Tyr Leu Ala Asn He Ala Ser Arg Val Thr Leu Val 

165 170 175 

His Arg Arg Glu Thr Phe Arg Ala Glu Lys He Leu Gin Asp Lys Leu 

180 185 190 

Gin Ala Arg Val Ala Glu Gly Lys He Val Leu Lys Leu Asn Ala Glu 

195 200 205 

Val Asp Glu Val Leu Gly Asp Thr Met Gly Val Thr Gly Val Arg Leu 

210 215 220 

Lys Thr Arg Asp Gly Gly Ser Glu Glu He Ala Val Asp Gly Met Phe 
225 230 235 240 

Val Ala He Gly His Thr Pro Asn Thr Ser Leu Phe Glu Gly Gin Leu 

245 . 250 255 

Ala Leu Lys Asp Gly Tyr Leu Val Val Asn Gly Gly Arg Glu Gly Asn 

260 265 270 

Ala Thr Ala Thr Asn Val Pro Gly Val Phe Ala Ala Gly Asp Val Ala 

275 280 285 

Asp His Val Tyr Arg Gin Ala He Thr Ser Ala Gly Ala Gly Cys Met 

290 295 300 

Ala Ala Leu Asp Val Glu Arg Tyr Leu Asp Ser Leu 
305 310 315 
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<210> 263 
<211> 345 
<212> PRT 

<213> Pyrococcus abyssi 



<400> 263 
Met Leu Leu Asn 
1 

Phe Ser Leu Gly 

20 

Trp Asp Val He 
35 

He Tyr Ala Ala 
50 

Leu Gly Gly Asn 
65 

Phe Pro Glu Gly 

His Val Lys Lys 

100 

He Asp Pro Ala 
115 

Val Lys Thr Ala 
130 

Ala Val Gly Ala 
145 

Phe Thr Gly Arg 

Phe Val Gly Lys 

180 

Gin Glu Ala Leu 
195 

His Arg Arg Asp 
210 

Lys Gin Ala Gly 
225 

Arg Gly Thr Asn 

Gly Glu Thr Phe 

260 

Tyr Glu Pro Lys 
275 

Tyr Gly Tyr He 
290 

He Phe Ala Ala 
305 

Ala Val Gly Gin 

Glu Ser Trp Asn 

340 



He His Gin Glu 
5 

Gly Leu Gly Lys 

He He Gly Ala 

40 

Arg Phe Gly Leu 
55 

Met Ala He Thr 
70 

He Ser Gly Ser 
85 

Tyr Gly Val Asp 

Glu Cys Ala Tyr 

120 

Asn Gly Lys Glu 
135 

Glu Pro Arg Lys 
150 

Gly Val Ser Tyr 
165 

Glu Val He Val 

Tyr Leu His Ser 

200 

Lys Phe Arg Ala. 
215 

He Pro Thr He 
230 

Lys Val Glu Ser 
245 

Glu Lys Lys Val 

Thr Asp Phe Val 

280 

Lys Val Asp Met 
295 

Gly Asp He Thr 
310 

Gly Ala He Ala 
325 

Gly Lys Ser He 



Ser Tyr Val Glu 
10 

Ser Arg Val Asp 
25 

Gly Pro Ala Gly 

Asp Thr He He 

60 

Asp Leu He Glu 
75 

Glu Leu Ala Lys 
90 

Val He Phe Asp 
105 

Tyr Glu Gly Pro 

Tyr Lys Gly Lys 

140 

Leu His Val Pro 
155 

Cys Ala Thr Cys 
170 

Val Gly Gly Gly 
185 

He Gly Val Lys 

Asp Lys He Leu 

220 

Leu Asn Thr Val 
235 

Val Val Leu Lys 
250 

Asp Gly Val Phe 
265 

Lys His Leu Gly 

Tyr Met Arg Thr 

300 

Asn Val Phe Lys 
315 

Ala Asn Ser Ala 
330 

Glu 
345 



Val Val Lys Met 
15 

Glu Ser Lys Val 
30 

Tyr Thr Ala Ala 
45 

He Thr Lys Asp 

Asn Tyr Pro Gly 

80 

Arg Met Tyr Glu 
95 

Glu Val Val Arg 
110 

Cys Gin Phe Glu 
125 

Thr He He He 

Gly Glu Lys Glu 

160 

Asp Gly Pro Leu 
175 

Asn Thr Ala Leu 
190 

Val Thr Leu Val 
205 

Gin Asp Arg Leu 

Val Thr Glu He 

240 

Asn Val Lys Thr 
255 

He Phe He Gly 
270 

He Thr Asp Glu 
285 

Lys Val Pro Gly 

Gin He Ala Val 

320 

Lys Glu Phe He 
33 5 



<210> 264 
<211> 334 
<212> PRT 
<213> Rickettsia 



prowazekii 



<400> 264 
Met Tyr Asn Thr 
1 

Phe Ala Val Phe 

20 

Asp Ala Gin Glu 
35 

Lys His He Tyr 
50 



Asp He. Val He 
5 

Gin Ala Gly Met 

Val He Gly Gly 

40 

Asp He Pro Ala 
55 



He Gly Ser Gly 
10 

Leu Gly Met Lys 
25 

Gin Cys He Thr 

Tyr Pro Lys He 

60 



Pro Val Gly Leu 
15 

Cys His Val He 
30 

Leu Tyr Pro Glu 
45 

Ala Ala Lys Glu 
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Leu He Lys Gin 
65 

His Leu Asn Gin 

Glu He Lys Thr 

100 

He Ala Ala Gly 
115 

Asn He Glu Ala 
130 

Lys Ser Lys Phe 
145 

Ser Ala Val Asp 

Tyr Leu Val His 

180 

Arg Gin Leu Arg 
195 

Gly Tyr Gin Leu 
210 

Val He Val Lys 
225 

He Leu Leu Pro 

Asn Trp Gly Phe 

260 

Tyr Tyr Gin Thr 
275 

His Tyr Val Gly 
290 

Ala Cys Ser Leu 
305 

Leu His Phe Glu 



Leu Glu Ser Gin 
70 

Gin Ala Thr Glu 
85 

Ser Lys Asn Thr 

Ala Gly Ala Phe 

120 

Phe Glu Gly Lys 
135 

Leu Gly Lys Asn 
150 

Trp Ala He Thr 
165 

Arg Arg Asp Lys 

His He Ala Glu 

200 

Asn Asn Leu Asp 
215 

Asp Leu Gin Asn 
230 

Phe Phe Gly Leu 
245 

Asn Val Arg Leu 

Asn He Lys Gly 

280 

Lys Leu Lys Leu 
295 

His His Ala Tyr 
310 

Tyr Ser Thr Asn 
325 



Ala Ala Pro Phe 

75 

Leu Asn Lys His 
90 

Leu He Lys Ser 
105 

Gly Pro Asn Lys 

Ser He Phe Tyr 

140 

He Val Val Ala 
155 

Leu Ser Glu He 
170 

Phe Thr Ala Ala 
185 

Thr Gly Lys He 

Gly His Asn Ser 

220 

Asn He Arg Lys 
235 

Lys Gin Asp Leu 
250 

Gin His He Glu 
265 

He Tyr Ala He 

He He Thr Gly 

300 

Ser Arg Val Phe 
315 

Lys Tyr Glu Gin 
330 



Asn Pro Val Tyr 

80 

Asp Asp Phe Phe 
95 

Lys Val He He 
110 

Pro Pro He Ala 
125 

Phe He Asn Asp 

Gly Gly Gly Asp 

160 

Ala Asn Lys He 
175 

Thr Glu Ser Val 
190 

Glu Leu Val Thr 
205 

Glu Leu Arg Ser 

Leu Asp Ala Asn 

240 

Gly Pro Leu Ala 
255 

Val Asp Asn Tyr 
270 

Gly Asp Val Ala 
2 85 

Phe Ala Glu Ala 

Asp Gly Lys Ala 

320 

Lys Gin 



<210> 265 
<211> 311 
<212> PRT 

<213> Staphylococcus aureus 
<400> 265 

Met Thr Glu He Asp Phe Asp He Ala He He Gly Ala Gly Pro Ala 

1 5 10 15 

Gly Met Thr Ala Ala Val Tyr Ala Ser Arg Ala Asn Leu Lys Thr Val 

20 25 30 

Met He Glu Arg Gly He Pro Gly Gly Gin Met Ala Asn Thr Glu Glu 

35 40 45 

Val Glu Asn Phe Pro Gly Phe Glu Met He Thr Gly Pro Asp Leu Ser 

50 55 60 

Thr Lys Met Phe Glu His Ala Lys Lys Phe Gly Ala Val Tyr Gin Tyr 
65 70 75 80 

Glv Asp He Lys Ser Val Glu Asp Lys Gly Glu Tyr Lys Val He Asn 

* . 85 90 9S 

Phe Gly Asn Lys Glu Leu Thr Ala Lys Ala Val He He Ala Thr Gly 

100 105 HO 

Ala Gly Tyr Lys Lys He Gly Val Pro Gly Glu Gin Glu Leu Gly Gly 

115 ** 120 125 

Arg Gly Val Ser Tyr Cys Ala Val Cys Asp Gly Ala Phe Phe Lys Asn 

130 " 135 140 

Lys Arg Leu Phe Val He Gly Gly Gly Asp Ser Ala Val Glu Glu Gly 
145 150 155 160 

Thr Phe Leu Thr Lys Phe Ala Asp Lys Val Thr He Val His Arg Arg 

165 170 175 

Asp Glu Leu Arg Ala Gin Arg He Leu Gin Asp Arg Ala Phe Lys Asn 

180 185 190 

Asp Lys He Asp Phe He Trp Ser His Thr Leu Lys Ser He Asn Glu 
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195 

*J~ m? mm* 










200 








205 










Glv 




Val 


Glv 


Ser 


Val 


Thr 


Leu Thr 


Ser 


Thr 

■mm. 


Lvs 


Astd Glv 




210 










215 








220 








Ser 


Glu 


Glu 


Thr 


His 


Glu 


Ala 


Asp 


Gly Val Phe 


lie 


Tyr 


lie 


Gly Met 


225 










230 








235 








240 


Lys 


Pro 


Leu 


Thr 


Ala 


Pro 


Phe 


Lys 


Asp 


Leu Gly 


He 


Thr 


Asn 


Asp Val 










245 










25 0 








255 


Gly 


Tyr 


lie 


Val 


Thr 


Lys 


Asp 


Asp 


Met 


Thr Thr 


Ser 


Val 


Pro 


Gly lie 








260 










265 








270 




Phe 


Ala 


Ala 


Gly 


Asp 


Val 


Arg Asp 


Lys 


Gly Leu Arg 


Gin 


lie 


Val Thr 






275 










280 








285 






Ala 


Thr 


Gly 


Asp 


Gly 


Ser 


lie 


Ala 


Ala 


Gin Ser 


Thr 


Ser 


Gly 


Tyr lie 




290 










295 








300 








Glu 


His 


Leu 


Asn 


Asp 


Gin 


Ala 

















305 310 



<210> 266 

<211> 326 

<212> PRT 

<213> Streptomyces coelicolor 



<400> 266 



Met 


Ser 


Thr 


Ala 


Lys 


Asp 


Val 


Arg Asp 


Val 


He 


Val 


He 


Gly 


Ser 


Gly 


1 








5 










10 










15 




Pro 


Ala 


Gly 


Tyr 

mm 

20 


Thr 


Ala 


Ala 


Leu 


Tyr 
25 


Thr 


Ala 


Arg 


Ala 


Ser 
30 


Leu 


Asn 


Pro 


Leu 


Val 


Phe 


Gly 


Gly 

mm 


Ala 


He 


Phe 


Val 


Gly Gly 


Ser 


Leu 


Thr 


Thr 






35 










40 










45 








Thr 


Thr 


Glu 


Val 


Glu 


Asn 


Phe 


Pro 


Gly 


Phe 


Pro 


Asp 


Gly Val 


Gin 


Gly 




50 










55 










60 










Pro 


Glu 


Leu 


Met 


Glu 


Asn 


Met 


Arg Ala 


Gin 


Ala 


Glu 


Arg 


Phe 


Gly Ala 


65 










70 










75 










80 


Glu 


Met 


Val 


Asp 


Asp 
85 


Asp 


He 


Val 


Ala 


Val 
90 


Asp 


Leu 


Thr 


Gly 


Asp 
95 


Val 


Lys 


Thr 


Val 


Thr 


Asp 


Thr 


Ala 


Gly Thr Val 


His 


Arg Ala Arg 


Thr 


Val 








100 










105 










110 






He 


Val 


Ala 
115 


Thr 


Gly 


Ser 


Gly 


Tyr 
120 


Arg 


Lys 


Leu 


Gly 


Val 
125 


Pro 


Lys 


Glu 


Asp 


Glu 
130 


Leu 


Ser 


Gly 


Arg 


Gly 
135 


Val 


Ser 


Trp 


Cys 


Ala 
140 


Thr 


Cys 


Asp 


Gly 


Phe 


Phe 


Phe 


Arg 


Asp 


Arg 


Asp 


He 


Val 


Val 


Val 


Gly Gly Gly Asp 


Thr 


145 










150 










155 










160 


Ala 


Met 


Glu 


Glu 


Ala 
165 


Thr 


Phe 


Leu 


Thr 


Arg 
170 


Phe 


Ala 


Arg 


Ser 


Val 
175 


Thr 


Val 


Val 


His 


Arg 


Arg 


Ser 


Ala 


Leu 


Arg 


Ala 


Ser 


Gin 


Val 


Met 


Gin 


Asn 








180 








185 










190 






Arg 


Ala 


Phe 


Ser 


Glu 


Asp 


Lys 


He 


Ser 


Leu 


Ala 


Phe 


Asp 


Ser 


Glu 


Val 




195 










200 










205 








Ala 


Thr 


Leu 


His 


Glu 


Glu 


Asn 


Gly Met 


Leu 


Ser 


Gly 


Met 


Thr 


Leu 


Arg 




210 










215 










220 










Asp 


Thr 


Leu 


Thr 


Gly 


Glu 


Thr 


Arg 


Glu 


Leu 


Ala 


Thr 


Thr Gly 


Leu 


Phe 


225 










230 










235 










240 


He 


Ala 


He 


Gly 


His 


Asp 


Pro 


Arg 


Thr 


Glu 


Leu 


Phe 


Lys 


Gly 


Gin 


Leu 








245 










250 










255 




His 


Leu 


Asp 


Ser 
260 


Glu 


Gly 


Tyr 


Leu 


Met 
265 


Val 


Glu 


Ser 


Pro 


Ser 
270 


Thr 


Arg 


Thr 


Asn 


Val 


Pro 


Gly 


Val 


Phe 


Gly Ala 


Gly 


Asp 


Val 


Val 


Asp 


His 


Thr 






275 








280 










285 








Tyr 


Arg 


Gin 


Ala 


He 


Thr 


Ala 


Ala 


Ser 


Ser 


Gly 


Cys 


Ala 


Ala 


Ala 


Leu 


290 










295 










300 










Asp 


Ala 


Glu 


Arg 


Tyr 


Leu 


Ala 


Ala Arg 


Ser 


Asp 


Thr 


Ser 


Val 


Ser 


Ala 


305 










310 










315 










320 


Glu 


Val 


Val 


Ala 


Val 


Ala 























325 
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<210> 267 
<211> 558 
<212> PRT 

<213> Streptomyces coelicolor 



<400> 267 



Met 


Ala 


Gin 


Ala 


Asp 


Gly 


Glu 


Thr 


Arg 


Thr 


Val 


He 


Met 


Thr 


Val 


Asp 


1 








5 










10 










15 




Asp 


Asp 


Pro 


Gly 


Val 


Ser 


Arg 


Ala 


Val 


Ala 


Arg 


Asp 


Leu 


Arg Arg 


Arg 




20 










25 










30 






Tyr 


Gly 


Ala 


Thr 


Tyr 


Arg 


He 


Val Arg Ala Glu Ser Gly Glu 


Ser 


Ala 


35 




















45 








Leu 


Asp 


Ala 


Leu 


Arg 


Glu 


Leu 


1-1 J' o 


Leu 


Arg 


Gly Asp 


Leu 


Val 


Ala 


Val 




50 








55 










60 










He 


Leu 


Ala 


Asp 


Tyr 


Arg 


Met 




Gin 


Met 


Asn 


Gly 


He 


Glu 


Phe 


Leu 


65 






70 










75 










80 


Glu 


Gin 


Ala 


Leu 


Asp 


Val 


Tyr 


Pro 

C «L- w 


Gly Ala 


Arg 


Arg 


Val 


Leu 


Leu 


Thr 










85 








90 










95 




Ala 


Tyr 


Ala 


Asp 


Thr 


Asn 


Ala 


Ala 


He 


Asp 


Ala 


He 


Asn 


Val 


Val 


Asp 






100 










105 










110 






Leu 


Asp 


His 


Tyr 


Leu 


Leu 


Lys 


Pro 


Trp 


Asp 


Pro 


Pro 


Glu 


Glu 


Lys 


Leu 




115 










120 










125 






His 


Tyr 


Pro 


Val 


Leu 


Asp 


Asp 


Leu 


Leu 


Gin 


Ala 


Trp 


Arg 


Ala 


Gly Asp 


130 










135 










140 










Arg 


Pro 


Val 


Pro 


Ser 


Thr 


Lys 


Val 


Val 


Gly His 


Arg 


Trp 


Ser 


Ala 


Arg 


145 










15 0 








155 










160 


Ser 


Ser 


Glu 


Val 


Arg 


Glu 


Phe 


Leu 


Ala 


Arg 


Asn 


Gin 


Val 


Pro 


Tyr 


Arg 










165 










170 










175 




Trp 


Tyr 


Ser 


Ser 


Asp 


Glu 


Pro 


Glu 


Gly Arg 


Arg 


Leu 


Leu 


Ser 


Ala 


Ala 




180 










185 










190 






Gly 


Gin 


Asp 


Gly 


Gin 


Arg 


Leu 


Pro 


Val 


Val 


He 


Thr 


Pro 


Asp 


Gly 


Thr 




195 










200 










205 








Pro 


Leu 


Val 


Glu 


Pro 


Glu 


Ala 


Pro 


Glu 


Leu 


Ala 


Ala 


Arg 


Val 


Gly 


Leu 




210 










215 










220 










Ala 


Thr 


Thr 


Pro 


Thr 


Ser 


Asp 


Phe 


Tyr 


Aap 


Leu 


Val 


Val 


He 


Gly 


Gly 


225 










230 








235 










240 


Gly 


Pro 


Ala 


Gly 

-* 


Leu 


Gly 


Ala 


Ala 


Val 


Tyr 


Gly Ala 


Ser 


Glu 


Gly 


Leu 






245 










250 










255 




Arg 


Thr 


Val 


Leu 


Val 


Glu 


Arg 


Ser 


Ala 


Thr 


Gly 


Gly 


Gin 


Ala 


Gly 


Gin 






260 










265 










270 






Ser 


Ser 


Arq 


He 


Glu 


Asn 


Tyr 


Leu Gly 


Phe 


Pro 


Asp Gly Val 


Ser 


Gly 






275 










280 










285 








Gly 


Gin 


Leu 


Thr 


Glu 


Arg 

am* 


Ala 


Arg 


Arg 


Gin 


Ala 


Ala 


Arg 


Phe 


Gly 


Ala 


290 










295 










300 










Glu 


He 


Leu 


Thr 


Ala 


Arg 


Glu 


Val 


Thr 


Gly 


Leu 


Glu 


Ala 


Asn 


Gly 


Ala 


305 










310 










315 








His 


320 


Ala 


Arg 


Val 


Val 


Arg 


Phe 


Ser 


Asp 


Gly 


Ser 


Ala 


He 


Ala 


Ala 


Ser 








325 










330 










335 




Val 


He 


Leu 


Ala 


Thr 


Gly 


Val 


Ser 


Tyr 


Arg 


Gin 


Leu 


Thr 


Ala 


Pro 


Gly 








340 










345 










350 






Thr 


Glu 


Asp 


Leu 


Ala 


Gly 


Cys 


Gly Val 


Phe 


Tyr 


Gly 


Ser 


Ala 


Leu 


Thr 






355 










360 










365 








Glu 


Ala 


Ala 


Ser 


Cys 


Gin 


Gly 


His 


Asp 


Val 


Tyr 


He 


Val 


Gly 


Gly 


Ala 




370 








375 










380 










Aan 


Ser 


Ala 


Gly 


Gin 


Ala 


Ala 


Met 


Tyr 


Leu Ala Arg Gly Ala 


Lys 


Ser 


385 








390 










395 










400 


Val 


Thr 


Leu 


Leu 


Val 


Arg 


Gly 


Gly 


Ser 


Leu 


Glu 


Ala 


Ser 


Met 


Ser 


Tyr 










405 






410 










415 




Tyr 


Leu 


He 


Gin 


Gin 


He 


Glu 


Glu 


Thr 


Pro 


Asn 


He 


Arg 


Val 


Arg 


Cys 






420 










425 










430 






Gly 


Thr 


Leu 


Val 


Glu 


Gly 


Ala 


His Gly Asp Gly His 


Leu 


Glu 


Arg 


Leu 




435 










440 










445 








Thr 


Leu 


Arg 


Asp 


Ala 


Ala 


Ser 


Gly Ala 


Thr 


Glu 


Leu 


Val 


Asp 


Ala 


Gin 




450 










455 










460 










Trp 


Leu 


Phe 


Val 


Phe 


He 


Gly 


Ala 


Ala 


Pro 


Leu 


Thr Asp 


Trp 


Leu 


Asp 


465 










470 








475 










480 


Gly 


Thr 


Val 


Leu 


Arg 


Asp 


Glu 


Arg Gly 


Phe 


He 


Leu 


Ala 


Gly 


Pro 


Asp 
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Leu Thr Pro Asp 

500 

Pro Tyr His Leu 
515 

Ala Arg Ala Glu 
530 

Ala Met Ala Val 
545 



485 

Gly Arg Pro Pro 

Glu Thr Ser Val 

520 

Ser Ala Lys Arg 
535 

Met Leu Val His 
550 



490 

Ala Gly Trp Glu 
505 

Pro Gly Val Phe 

Val Ala Ser Ala 

540 

Arg Tyr Leu Glu 
555 



495 

Leu Asp Arg Pro 
510 

Val Ala Gly Asp 

525 

Val Gly Glu Gly 
Gin Ser 



<210> 268 
<211> 303 
<212> PRT 

<213> Streptococcus pneumoniae 

<400> 268 M ^ w _ 

Met Tyr Asp Thr He He He Gly Ala Gly Pro Ala Gly Met Thr Ala 

1 5 10 15 

Ala Leu Tyr Ala Ala Arg Ser Asn Leu Lys Val Ala Leu He Glu Gly 

20 25 30 

Gly Leu Pro Gly Gly Gin Met Asn Asn Thr Ser Asp He Glu Asn Tyr 

35 ~ 40 45 

Pro Gly Tyr Ala Asn He Ser Gly Pro Glu Leu Ala Glu Lys Met Phe 

50 55 60 

Glu Pro Leu Glu Asn Leu Gly Val Glu His He Tyr Gly Tyr Val Glu 
65 70 75 80 

Asn Val Glu Asp His Gly Asp Phe Lys Lys Val Met Thr Asp Asp Gin 

85 90 95 

Thr Tyr Glu Thr Arg Thr Val He Val Ala Thr Gly Ser Lys His Arg 

100 105 HO 

Pro Leu Gly Val Pro Gly Glu Glu Glu Leu Asn Ser Arg Gly Val Ser 

115 120 125 

Tyr Cys Ala Val Cys Asp Gly Ala Phe Phe Arg Asp Gin Asp Leu Leu 

130 ~ 135 140 

Val Val Gly Gly Gly Asp Ser Ala Val Glu Glu Ala Leu Phe Leu Thr 
145 " 150 155 160 

Arq Phe Ala Lys Thr Val Thr He Val His Arg Arg Asp Gin Leu Arg 

165 170 175 

Ala Gin Lys Val Leu Gin Asp Arg Ala Phe Ala Asn Glu Lys He Ser 

180 185 190 

Phe He Trp Asp Ser Val Val Arg Glu He Lys Gly Glu Asn Arg Val 

195 200 205 

Glu Ser Val Val Phe Glu Asn Val Lys Thr Gly Gin Val Thr Glu Gin 

210 215 220 

Ala Phe Gly Gly Val Phe lie Tyr Val Gly Leu Asp Pro Leu Ser Asp 
225 230 235 240 

Phe Val Lys Glu Leu Asn He Gin Asp Gin Ala Gly Trp He Val Thr 

245 250 255 

Asp Asn His Met Lys Thr Ala Val Asp Gly He Phe Ala Val Gly Asp 

260 265 270 

Val Arg Leu Lys Asp Leu Arg Gin Val Thr Thr Ala Val Gly Asp Gly 

275 280 285 

Ala He Ala Gly Gin Glu Ala Tyr Lys Phe He Thr Glu His Ser 
290 " 295 300 



<210> 269 
<211> 330 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 269 

Met Lys Asp Lys Ala Tyr Asp He 

1 5 
Gly Leu Phe Ala Ala Phe Tyr Ala 

20 



Thr He He Gly Gly Gly Pro He 

10 15 
Gly Leu Arg Gly Val Thr Val Lys 
25 ~ ' 3 0 



-160- 



WO 02/50289 



PCT/US0 1/50240 



i i e 


Tl » 
J. -LC3 


^ ? 




T < <=> n 
LXKZ Li 


C e» 
OCX. 


VJ X u 


T i£3l 1 

1_] V-S >wL 

40 


Gly 


Gly 


Gin 


pT~0 


Ala 

■J- * — W ^#«L 

45 


He 


Leu 


Tvr 


XT A. U 




T .~\ro 
Lyb 


I'lC u. 


Tip 
IXC 


Tirr" 


■"■fair' 


He 


Pro 


Ala 


j.y x 


Pro 

X. V— ' 


Ser 


Leu 


Thr 


Gly 




O Lf 








55 










60 










\7zx 1 
Vdl 


LtJLU. 


JjcU 


JL ill 


m ii 

VjjX LL 


7\ en 
./-Vfaii 


Ti^i l 


x x c 


Lys 


Gin 




Ser 




Phe 

X 1 X W- 


Glu 


Asp 


65 










70 










75 










80 


Arg 


X nr 


ml, „ 


lie 


uys 


jueu 


iiys 


n~\ ii 

LalU 


Glu 


Val 


LlC Li 


± ILL 


rue 


Aia LJ 




Val 

V tA A. 








85 










90 










95 




Lys 


Gly 


Gly 


T"lV-» » 


oer 


lie 


Arg 


mr 


Asn 


Lys 


Aid 


m n 

Lai Li 


ill fa 


ir lie 


Cpv 

OCX 


uy fa 


1 U U 










105 










110 






Ala 


lie 


lie 


lie 


Aia 


cys 


fl ir 

Lxiy 


Asn 


Gly Ala 


Pri<=» 
irXlc 


Ala 


prn 

2rX LJ 


At~cj 


A iiA 


IJW A 






tic; 

1 1 D 








X 4a w 










125 








Gly 


Lie u. 


r»i •» i 

LjlU 


C ~v~ 
OCL 


Hi ii 


rxi n 

blU 


Afali 




Ala 


Asp 


His 


Ran 


Leu 


Phe 


Tvr 

A jr -l_ 


Asn 


J.JU 










1 

~1 3 _J 










140 










Vdl 


ill 5 


uin 


J_l tS Li 




m n 


Php 


ni ex 


Glv 


Gin 


Lys 


Val 


Val 


He 


Cys 


Glv 


145 








150 










155 










160 


diy 


Gly 


ASp 


ber 


Ala 

Aia 


i; a 1 

v ai 


ASp 


r T , T*T\ 

irp 


Ala 


Leu 


Ala 


TiP>ii 


Glu 


Ann 


He 


Ala 




165 










170 










175 




tjlU 


C* a 


v ai 


j. n it 
i ft n 

1 o u 


\7n 1 
vdl 


V dl 


All o 


Ai y 


Arg 
185 


Asp 


Ala 


Php 




Ala 
190 


His 


Glu 

V<-f -X- V* 


Hi S 


ber 


Vdl 


bill 


lieu 


T Q 1 1 

lieu 


iiy s 


Ala. 


Ser 


Thr 


Val 


Asn 


Leu 


Tien 


Thr 

X. *>X X- 


Pro 

X. Xi 






1 9 ^ 

X J ZJ 








200 










205 








iyr 


vai 


irro 


iiys 


Aia 


lieu 


T trrl 

iiy fa 


/Til , r 

uiy 


He 


Gly Asn 


Leu 


Ala 


Glu 


Lvs 


Leu 


Tl ft 

z i u 










Z 1 D 










220 










Vai 


Tip 

lie 


f2l Tl 
V?1X1 


ijy fa 


vdl 


uy a 


<Ji Li 


Sen 
fikfa^ 


Glu 


Val 


Leu 


Glu 


Leu 


Glu 


Leu 


Asp 


225 








230 








235 










240 


Ser 


Leu 


lie 


Val 


Ser 
245 


Phe 


Gly 


Phe 


Ser 


Thr 

250 


Ser 


Asn 


Lys 


Asn 


Leu 

255 


Lys 


Asn 


Trp 


Asn 


Leu 


Asp 


Tyr 


Lys 


Arg 


Ser 


Ser 


He 


Thr 


Val 


Ser 


Pro 


Leu 






260 






265 










270 






Phe 


Gin 


Thr 


Ser 


Gin 


Glu 


Gly 


lie 


Phe 


Ala 


He 


Gly Asp 


Ala 


Ala 


Ala 






275 








280 










285 








Tyr 


Asn 


Gly 


Lys 


Val 


Asp 


Leu 


He 


Ala 


Thr Gly 


Phe 


Gly Glu 


Ala 


Pro 


290 










295 










300 










Thr 


Ala 


Val 


Asn 


Gin 


Ala 


lie 


Asn 


Tyr 


He 


Tyr 


Pro 


Asp 


Arg 


Asp 


Asn 


305 










310 








315 










320 


Arg 


Val 


Val 


His 


Ser 


Thr 


Ser 


Leu 


He 


Asp 















325 330 



<210> 270 

<211> 325 

<212> PRT 

<213> Sulfolobus solfataricus 



<400> 270 



Met 


Pro 


Leu 


Lys 


Thr 


Tyr 


Asp 


Thr 


He 


He 


Val 


Gly 


Ala 


Gly 


He 


Ala 


1 






5 






10 










15 




Gly 


Leu 


Ser 


Ala 


Ala 


Leu 


Tyr 


Ser 


Ser 


Arg 


Gin 


Lys 


Leu 


Ser 


Thr 


Leu 






20 










25 










30 






Val 


Leu 


Ser 


Lys 


Asp 


Leu Gly 


Gly 


Gin 


Leu 


Thr 


Leu 


Thr 


Asp 


Leu 


He 






35 








40 










45 








Glu 


Asn 


Tyr 


Pro 


Gly 


He 


Glu 


Ser 


Thr 


Gly Gly 


Leu 


Thr 


Leu 


Ala 


Gin 




50 










55 










60 










Lys 


He 


Glu 


Lys 


Gin 


Ala 


Lys 


Lys 


Phe 


Gly 


Ala 


Glu 


Phe 


He 


Tyr 


Gly 


65 








70 










75 










80 


Glu 


Glu 


Val 


Lys 


Glu 


He 


Ala 


Gin 


Glu 


Ser Asp 


Leu 


Phe 


lie 


He 


Lys 








85 










90 










95 




Gly 


He 


Lys 


Gly 


Glu 


Tyr 


Ala 


Gly 


Arg 


Ala 


Leu 


He 


Leu 


Ala 


Phe 


Gly 






100 










105 










110 






Lys 


Thr 


Pro 


Arg 


Glu 


He 


Asn 


Val 


Pro 


Gly 


Glu 


Gin 


Glu 


Phe 


Lys 


Gly 




115 










120 










125 








Lys 


Gly 


Val 


Ser 


Tyr 


Cys 


Ala 


He 


Cys 


Asp 


Ala 


Ala 


Phe 


Phe 


Lys 


Gly 


130 










135 










140 










Lys 


Pro 


Ala 


Ala 


Val 


He 


Gly 


Glu 


Gly Glu 


Pro 


Gly 


He 


Glu 


Ala 


He 


145 










150 










155 










160 


Glu 


Leu 


Leu 


Ser 


Asn 


Tyr 


Ala 


Asn 


Pro 


Ala 


Tyr 


Tyr 


He 


Thr 


Ser 


Ser 
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165 170 175 



Ser 


Tyr 


Leu 


Ala 


Gly 


Glu 


Glu 


Glu 


He 


Val 


Lys 


Asn Val 


Val 


Asn 


Lys 






180 








185 








190 






Pro 


Thr 


Val 


Lys 


He 


Leu 


Thr 


Ser 


Ser 


Arg 


Val 


Leu Glu 


He 


Arg 


Gly 






195 

^ 








200 








205 




He 




Asn 


Ser 


Lys 


Val 


Glu 


Glu 


Leu 


Val 


lie 


Lys 


Arg 


Gly Asp 


Glu 


Leu 




210 








215 










220 








Gin 


Leu 


Lys 


Val 


Asp 


Gly 


Val 


He 


He 


Glu 


Met 


Gly Tyr 


Thr 


Leu 


Lys 


225 






230 










235 








240 


Thr 


Glu 


Phe 


Leu 


Lye 


Gly 


Phe 


Val 


Glu 


Leu 


Asn 


Glu Lys 


Gly 


Glu 


He 










245 








250 








255 




lie 


Val 


Asp 


Glu 


Leu Gly 


Arg 


Thr 


Ser 


Arg 


Glu 


Gly Val 


Phe 


Ala 


Ala 






260 










265 








270 






Gly 


Asp 


Val 


Thr 


Gin 


Thr 


Pro 


Tyr 


Lys 


Gin 


Ala 


Val Val 


Ala 


Ala 


Ala 


275 










280 








285 








Glu 


Gly 
290 


Val 


Lys 


Ala 


Ala 


Leu 
295 


Ser 


Ala 


Tyr 


Asn 


Tyr He 
300 


Arg 


Ser 


Lys 


Arg 


Gly 


Leu 


Pro 


Pro 


Val 


Thr 


Val 


Asp 


Trp 


Lys 


Ala Glu 


Lys 


Lys 


Lys 


305 








310 










315 








320 


Val 


Ser 


Phe 


Arg 


Leu 























<210> 271 
<211> 323 
<212> PRT 

<213> Sulfolobus solfataricus 



<400> 271 



Met 


Ser 


Leu 


Leu 


Pro 


Arg 


Thr 


Thr 


Ser 


Val 


Lys 


Pro 


Gly 


Glu 


Lys 


Phe 


1 








5 








10 










15 




Asp 


Val 


He 


He 


Val 


Gly Leu Gly 


Pro 


Ala 


Ala 


Tyr Gly 


Ala 


Ala 


Leu 






20 










25 










30 






Tyr 


Ser 


Ala 


Arg 


Tyr 


Met 


Leu 


Lys 


Thr 


Leu 


Val 


He 


Gly 


Glu 


Thr 


Pro 




35 










40 










45 








Gly Gly Gin 


Leu 


Thr 


Glu Ala Gly 


He 


Val 


Asp 


Asp 


Tyr 


Leu Gly 


Leu 




50 










55 










60 






His 


He 


He 


Glu 


He 


Gin Ala 


Ser 


Asp 


Met 


He 


Lys 


Val 


Phe 


Asn 


Lys 


65 










70 








75 










80 


Glu 


Lys 


Tyr 


Glu 


Val 


Pro 


Val 


Leu 


Leu 


Asp 


He 


Val 


Glu 


Lys 


He 


Glu 






85 










90 










95 




Asn Arg 


Gly 


Asp 


Glu 


Phe 


Val 


Val 


Lys 


Thr 


Lys 


Arg 


Lys 


Gly Glu 


Phe 






100 










105 










110 






Lys 


Ala 


Asp 


Ser 


Val 


He 


Leu 


Gly 


He 


Gly 


Val 


Lys 


Arg 


Arg 


Lys 


Leu 




115 










120 










125 








Gly 


Val 


Pro 


Gly 


Glu 


Gin 


Glu 


Phe 


Ala 


Gly 


Arg 


Gly 


He 


Ser 


Tyr 


Cys 


130 










135 










140 








He 


Ser 


Val 


Cys 


Asp 


Ala 


Pro 


Leu 


Phe 


Lys 


Asn 


Arg 


Val 


Val 


Ala 


Val 


145 






150 










155 










160 


Gly Gly Gly 


Asp 


Ser 


Ala 


Leu 


Glu 


Gly Ala 


Glu 


He 


Leu 


Ser 


Ser 


Tyr 










165 










170 










175 




Ser 


Thr 


Lys 


Val 


Tyr 


Leu 


He 


His 


Arg 


Arg 


Asp 


Thr 


Phe 


Lys 


Ala 


Gin 






180 








185 










190 




Val 


Pro 


He 


Tyr 


Val 


Glu 


Thr 


Val 


Lys 


Lys 


Lys 


Pro 


Asn 


Val 


Glu 


Phe 






195 










200 










205 






Gin 


Leu 


Asn 


Ser 


Val 


Val 


Lys 


Glu 


He 


Lys 


Gly Asp 


Lys 


Val 


Val 


Lys 




210 










215 










220 








Val 


Val 


Val 


Val 


Glu 


Asn 


Leu 


Lys 


Thr 


Gly Glu 


He 


Lys 


Glu 


Leu 


Asn 


225 










230 








235 










240 


Asn 


Gly Val 


Phe 


He 


Glu 


He 


Gly 


Phe 


Asp 


Pro 


Pro 


Thr 


Asp 


Phe 


Ala 










245 










250 










255 




Lys 


Ser 


Asn 


Gly 


He 


Glu 


Thr 


Asp 


Thr 


Asn 


Gly 


Tyr 


He 


Lys 


Val 


Asp 






260 










265 










270 






Glu 


Trp 


Met 


Arg 


Thr 


Ser 


Val 


Pro 


Gly 


Val 


Phe 


Ala 


Ala 


Gly Asp 


Cys 




275 










280 










285 








Thr 


Ser 


Ala 


Trp 


Leu 


Gly 


Phe 


Arg 


Gin 


Val 


He 


Thr 


Ala 


Val 


Ala 


Gin 




290 








295 










300 
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Gly Ala Val Ala Ala Thr Ser Ala 
305 310 
Gly Lys Lys 



<210> 272 
<211> 332 
<212> PRT 

<213> Sulf olobus solfataricus 



<400> 272 



Met 


Asp 


Glu 


Tvr 


Asp 


He 


Val 


Val 


1 








5 








Phe 


Glv 


Thr 


Phe 


Tvr 


Ala 


Gly 


Leu 








20 












Ala 


Gin 


Asp 


Glu 


Leu Gly 


Glv 






35 










40 


TjVS 


lie 


Val 


Tyr 


Asp 


Val 


Gly 


Glv 




50 










55 




Leu 

■U S-« SA 


Ala 


Gin 


Arg 


Leu 


He 


Glu 


Gin 


65 








70 






Lvs 


Val 


Asn 


Glu 


Leu 


Ala 


Asp 


Met 


TrD 


He 


Val 


Lys 


85 
Thr 


Asp 


Lys 


Ala 








100 










He 


Ala 


Ala 


Gly 


He 


Gly 


Lys 


He 






115 










120 


Glv 


Glu 


He 


Glu 


Tyr 


Glu 


Asn 


Arg 




130 










135 






i_,ys 




Phe 


Glu 


Gly 


Lys 




145 










150 






Ser 


Ala 


Val 


Asp 


Trp 


Ala 


Leu 


Thr 










165 








Thr 


Leu 


He 


His 


Arg 


Arg 


Asp 


Gin 








180 










Lys 


Glu 


Leu 


Phe 


Arg 


Val 


Ala 


Asn 






195 










200 


Glu 


Val 


Lys 


Gly Asp 


Gly Asn 


Lys 




210 










215 




Asn 


Arg 


Thr 


Lys 


Glu 


Glu 


Lys 


Val 


225 










230 






Ser 


lie 


Gly 


Tyr 


Lys 


Gly Asp 


Leu 










245 








Thr 


Met 


Lys 


Gly Arg 


Asp 


He 


Val 








260 










Leu 


Pro 


Gly 


Val 


Tyr 


Ala 


Gly 


Gly 






275 










280 


Pro 


Lys 


Leu 


Ala 


Leu 


He 


Ala 


Val 




290 










295 




He 


Ser 


Val 


Ala 


Lys 


Lys 


Tyr 


Val 


305 










310 






Gly 


His 


Ser 


Ser 


Glu 


Met 


Asp 


Lys 










325 









<210> 273 
<211> 324 
<212> PRT 

<213> Rhizobium loti 
<400> 273 

Met Thr Thr Lys His Ala Pro Val 

1 5 
Gly Tyr Thr Ala Ala Val Tyr Ala 

20 

Leu Val Ala Gly Leu Gin Gin Gly 



Tyr 


Arg 


Tyr 


Val 


Thr 


Glu 


Lys 


Lys 






315 










320 


He 


Glv 


Glv 


Glv 


Pro 


Val 


Gly 


Leu 




10 










15 




Ar ci 


Asn 


Met 


Lys 


Thr 


Leu 


Leu 


He 


25 










30 






Gin 


Leu 


Val 


Ser 


Leu 


Tvr 


Pro 


Glu 










45 








Leu 


Ala 


Glv 


He 


Gin 


Ala 


Tyr 


Glu 








60 










Ala 


Lys 


Met 


Phe 


Gly 


Pro 


Asp 


He 




75 










80 


He 


Glu 


LVS 


Thr 


Asn 


Asp 


Asn 


Met 




90 










95 




Thr 


Tyr 


Lys 


Thr 


Lys 


Thr 


He 


Phe 


105 










110 






Val 


Pro 


Ser 


Arc? 


Leu 


Gly 


Ala 


Lys 










125 








Gly Val 


Tvr 


Tvr 


Thr 


Val 


Arcr 


Arcr 








140 










Val 


Leu 


He 


Val 


Glv 


Glv 


Gly 


Asp 






155 










160 


Leu 


Ala 


Pro 


Val 


Ala 


Lvs 


Ser 


Val 




170 










175 




Phe 


Arg 


Ala 


His 


Glu 


Arg 


Ser 


Val 


185 










190 






Val 


Tyr 


Val 


Trn 


His 


Glu 


Leu 


Lys 










205 








Val 


Thr 


Gin 


Ala 


He 


He 


Phe 


Asp 








220 










Leu Asp 


Val 


Asp 


Ser 


Val 


He 


He 






235 










240 


Gly Asn 


He 


Pro 


Lys 


Trp 


Gly 


Val 




250 










255 




Val 


Asn 


Gly 


Arg 


Met 


Glu 


Thr 


Asn 


265 








270 






Asp 


He 


Val 


Gin 


Met 


Glu 


Glv 


Ser 










285 








Gly 


Phe 


Ala 


His 


Ala 


Ala 


He 


Ala 






300 










Glu 


Pro 


Asn 


Ala 


Ser 


Leu 


Phe 


Ala 






315 










320 


Phe 


Lys 


Pro 


Lvs 












330 














Leu 


He 


He 


Gly 


Ser 


Gly 


Pro 


Ala 




10 










15 




Ala 


Arg 


Ala 


Met 


Leu 


Lys 


Pro 


Met 


25 








30 






Gly Gin 


Leu 


Met 


He 


Thr 


Thr 


Asp 



-163- 



WO 02/50289 



PCT/US01/50240 







35 










40 


Val 


Glu 


Asn 


Tyr 


Pro 


Gly 


Phe 


Ala 




50 










55 v 




Met 


Glu 


Gin 


Met 


Met 


Lys 


Gin 


Ala 


65 










70 






Asn 


Asp 


lie 


lie 


Thr 
85 


Glu 


Val 


Asp 


Lys 


Gly 


Asp 


Ser 


Gly 


Thr 


Thr 


Tyr 








100 










Thr 


Gly 


Ala 


Gin 


Ala 


Lys 


Trp 


Leu 






115 










120 


Met 


Gly 


Phe 


Gly 

-* 


Val 


Ser 


Ala 


Cys 




13 0 










135 




Arg 


Gly 


Lys 


Asp 


Val 


Ala 


Val 


Val 


145 










150 






Glu 


Ala 


Leu 


Tyr 


Leu 


Ser 


Asn 


Leu 










165 








Arg 


Arg 


Ser 


Asp 


Phe 


Arg 


Ala 


Glu 








180 










Gin 


Lys 


Asp 


Asn 


Val 


Arg 


Val 


lie 






195 










200 


Thr 


Gly 


Arg 


Pro 


Gly 


Lys 


Ala 


Pro 




210 










215 




Lys 


Leu 


Lys 


His 


Ala 


Val 


Thr 


Gly 


225 










230 






Gly 


Val 


Phe 


Val 


Ala 


lie 


Gly His 










245 








Gly 


Lys 


Leu 


Lys 


Gin 


Lys 


Pro 


Asn 








260 










Ser 


Thr 


Arg 


Thr 


Asp 


Val 


Pro 


Gly 






275 










280 


Asp 


Asp 


Val 


Tyr 


Arg 


Gin 


Ala 


Val 




290 










295 




Ala 


Ala 


Leu 


Glu 


Ala 


Glu 


Lys 


Tyr 


305 










310 






Glu 


Ala 


Ala 


Glu 











<210> 274 
<211> 343 
<212> PRT 

<213> Rhizobium loti 



<400> 274 



Met 


Thr 


Gly 


He 


He 


Ser 


Thr Asp 


1 








5 






Val 


Gly 


Leu 


Phe 


Ala 


Val 


Phe Glu 








20 








His 


Leu 


lie 


Asp 


He 


Leu 


Asp Lys 






35 








40 


Tyr 


Pro 


Glu 


Lys 


Pro 


He 


Tyr Asp 




50 










55 


Ala 


Gin 


Gly 


Leu 


Val 


Asp 


Lys Leu 


65 










70 




Pro 


Asp 


Phe 


Thr 


Tyr 


Asn 


Arg Met 










85 






Asp 


Gly 


Ser 


Phe 


Arg 


Val 


Thr Thr 








100 








Lys 


Val 


Val 


Val 


He 


Ala Ala Gly 






115 








120 


Pro 


Pro 


He 


Pro 


Gly 


He 


Glu Pro 




130 










135 


Ser 


Val 


Arg 


Arg 


Met 


Glu 


Asp Phe 


145 










150 




Gly 


Gly 


Gly 


Asp 


Ser 


Ala 


Leu Asp 










165 







Asp 


Pro 


lie 


Gin 


Gly 


Pro 


Trp 


Leu 








60 










iGlu 


His 


Val 


Gly Thr Asp 


He 


He 






75 










80 


Leu 


Asn 


Val 


Arg 


Pro 


Phe 


Arg 


Ala 




90 










95 




Thr 


Ala 


Asp 


Ala 


Leu 


He 


He 


Ala 


105 










110 






Gly 


He 


Pro 


Thr 


Glu 


Gin 


Asp 


Phe 










125 








Ala 


Thr 


Cys 


Asp 


Gly 


Phe 


Phe 


Tyr 








14 0 










Gly 


Gly 


Gly 


Asn 


Ser 


Ala 


Val 


Glu 






155 










160 


Ala 


Lys 


Ser 


Val 


Thr 


Val 


He 


His 




170 










175 




Arg 


He 


Leu 


Arg 


Glu 


Arg 


Leu 


Leu 


185 










190 






Trp 


Asp 


Thr 


Val 


Val 


Asp 


Glu 


He 










205 








Leu 


Pro 


Pro 


Ser 


Val 


Glu 


Gly 


Leu 








220 










Ala 


Glu 


Thr 


His 


Leu 


Lys 


Val 


Asp 






235 










240 


Ala 


Pro 


Ala 


Val 


Glu 


Leu 


Phe 


Val 




250 










255 




Gly 


Tyr 


Leu 


Trp 


Thr 


Ala 


Pro 


Asn 


265 










270 






Val 


Phe 


Ala 


Ala 


Gly Asp 


Val 


Thr 










285 








Thr 


Ala 


Ala 


Gly 


Leu 


Gly 


Cys 


Met 








300 










Leu 


Ala Gly 


He 


Glu 


Val 


His 


Arg 






315 










320 


Val 


Leu 


He 


Val 


Gly Ala 


Gly 


Pro 




10 










15 




Leu 


Gly Leu 


Phe 


Asp 


Met 


Lys 


Cys 


25 










30 






Pro 


Gly Gly Gin 


Cys 


Ala 


Glu 


Leu 










45 








He 


Pro 


Gly 


Trp 


Pro 


Ser 


He 


Ser 








60 










Leu 


Glu 


Gin 


He 


His 


Pro 


Phe 


Lys 






75 










80 


Val 


Ser 


Ser 


Leu 


Glu 


Lys 


Leu 


Glu 




90 










95 




Asp 


Glu 


Asn 


Glu 


Val 


Phe 


Glu 


Ala 


105 










110 






Gly 


Gly Ser 


Phe 


Gin 


Pro 


Lys 


Arg 










125 








Tyr 


Glu Gly 


Lys 


Ser 


Val 


Phe 


Tyr 








140 










Arg 


Gly His 


Asp 


Leu 


Val 


He 


Val 






155 










160 


Trp 


Thr 


Leu 


Asn 


Leu 


Gin 


Pro 


Val 


170 










175' 
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Ala 


Lys 


Ser 


Val 


Thr 


Leu 


Val 


His 


Arg 


Arg 


Pro 


Glu 


Phe 


Arg 


Ala 


Ala 






180 










185 










190 






Pro 


Asp 


Ser 


Val 


Asn 


Lys 


Met 


Tyr 


Ala 


Met 


Gin 


Glu 


Met 


Lys 


Gin 


Leu 




195 










200 










205 








Glu 


Phe 


Arg 


Val 


Gly 


Gin 


Val 


Thr 


Gly 


Leu Thr Gly Ala Asp 


Gly 


Gin 




210 






215 










220 










Leu 


Ser 


Ser 


Ala 


Thr 


He 


Lys 


Gly 


Gly 


Pro 


Asp 


Gly Asp 


He 


Glu 


Val 


225 










230 










235 










240 


Pro 


Cys 


Thr 


Arg 


Met 


Leu 


Pro 


Phe 


Phe 


Gly Leu Thr 


Met 


Lys 


Leu 


Gly 






245 










250 










255 




Pro 


lie 


Ala 


Glu 


Trp 


Gly 


Leu 


Asn 


Leu 


His 


Glu 


Asn 


Leu 


He 


Pro 


Val 








260 






265 










270 






Asp 


Thr 


Glu 


Lys 


Phe 


Gin 


Thr 


Ser. 


Val 


Pro 


Gly 


He 


Phe 


Ala 


Val 


Gly 




275 








280 










285 








Asp 


He 


Asn 


Ser 


Tyr 


Pro 


Gly 


Lys 


Leu 


Lys 


Leu 


lie 


Leu 


Ser 


Gly 


Phe 


290 








295 










300 










His 


Glu 


Val 


Ala 


Leu 


Met 


Ala 


Gin 


Ala 


Ala 


Lys 


Arg 


He 


Val 


Ser 


Pro 


305 










310 










315 










320 


Gly 


Glu 


Arg 


He 


Val 


Phe 


Gin 


Tyr 


Thr 


Thr 


Ser 


Ser 


Thr 


Ser 


Leu 


Gin 






325 








330 










335 




Lys 


Lys 


Leu 


Gly 
340 


Val 


Val 


Gly 





















<210> 275 
<211> 15 
<212> PRT 

<213> Sac char omyces cerevisiae 
<220> 

<221> VARIANT 
<222> 9, 11 

<223> Xaa = Any Amino Acid 
<400> 275 

Val His Asn He Val Thr He He Xaa Ser Xaa Pro Ala Ala His 
15 10 15 



<210> 276 
<211> 104 
<212> PRT 

<213> Staphylococcus aureus 
<400> 276 



Met 


Ala 


He 


Val 


Lys 


Val 


Thr 


Asp 


Ala 


Asp 


Phe 


Asp 


Ser 


Lys 


Val 


Glu 


1 








5 










10 










15 




Ser Gly 


Val 


Gin 


Leu 


Val 


Asp 


Phe 


Trp 


Ala 


Thr 


Trp 


Cys 


Gly 


Pro 


Cys 








20 








25 










30 






Lye 


Met 


He 


Ala 


Pro 


Val 


Leu 


Glu 


Glu 


Leu 


Ala 


Ala 


Asp 


Tyr 


Glu 


Gly 




35 










40 










45 








Lys 


Ala 


Asp 


He 


Leu 


Lys 


Leu 


Asp 


Val 


Asp 


Glu 


Asn 


Pro 


Ser 


Thr 


Ala 


50 










55 










60 










Ala 


Lys 


Tyr 


Glu 


Val 


Met 


Ser 


He 


Pro 


Thr 


Leu 


He 


Val 


Phe 


Lys 


Asp 


65 






70 










75 










80 


Gly 


Gin 


Pro 


Val 


Asp 


Lys 


Val 


Val 


Gly 


Phe 


Gin 


Pro 


Lys 


Glu 


Asn 


Leu 








85 










90 










95 




Ala 


Glu 


Val 


Leu 


Asp 


Lys 


His 


Leu 



















100 



<210> 277 

<211> 92 

<212> PRT 

<213> Staphylococcus xylosus 

<400> 277 
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Met 


Ala 


Glu 


Gin 


Val 


Asp 


Phe 


Asp 


1 
Ala 


Gly 


Met 


Thr 


5 
Ala 


Ala 


Val 


Tyr 






20 










Val 


Met 


lie 


Glu 


Arg 


Gly Met 


Pro 






35 










40 


Glu 


Val 


Glu 


Asn 


Phe 


Pro 


Gly 


Phe 




50 










55 




Sex 


Thr 


Lys 


Met 


Phe 


Glu 


His 


Ala 


65 








70 






Tyr 


Gly 


Asp 


He 


Lys 


Ser 


lie 


Glu 



85 



He Ala He He Gly Ala Gly Pro 

io is 

Ala Ser Arg Ala Asn Leu Ser Thr 
25 ~ 30 

Gly Gly Gin Met Ala Asn Thr Glu 

45 

Glu Met Val Thr Gly Pro Asp Leu 

60 

Lys Lys Phe Gly Ala Lys Tyr Gin 
75 80 

Asp Lys Gly Ser 
90 



<210> 278 
<211> 319 
<212> PRT 

<213> Thermoplasraa 



acidophilum 



<400> 278 
Met Glu Phe Asn 
1 

Phe Asp Val Val 

20 

Val Tyr Ala Ala 
35 

Val Ala Gly Gly 
50 

Gly Phe Lys Ser 
65 

His Ala Ala Asn 

lie Lys Lys Thr 

100 

Tyr His Ala Lys 
115 

Leu Gly Val Lys 
130 

Cys Ser Thr Cys 
145 

He Gly Gly Gly 

Tyr Val Lys Asn 

180 

Glu Asn Ala Tyr 
195 

Met Asn Ala Gin 
210 

Gly Val Lys Tyr 
225 

Thr Asp Gly Val 

Leu Lys Asp Ser 

260 

Asp Ser Arg Gin 
275 

Val Thr Ser Gly 
290 

Cys Lys Ala Ala 
305 



Leu His Ala Val 
5 

He Val Gly Ala 

Arg Ser Gly Phe 

40 

Leu Thr Ala Glu 
55 

He Val Gly Ser 
70 

Tyr Ala Lys He 
85 

Gin Gly Gly Phe 

Tyr Val He He 

12 0 

Gly Glu Ser Glu 
13 5 

Asp Gly Tyr Leu 
150 

Asn Ser Gly Ala 
165 

Val Thr He He 

Val Gin Glu He 

200 

Val Thr Glu He 
215 

Lys Asp Arg Thr 
230 

Phe He Tyr Val 
245 

Gly Val Lys Leu 

Arg Thr Ser Val 

280 

Asn Phe Ala Gin 
295 

Leu Ser Leu Tyr 
310 



Ser Ser Glu Glu 
10 

Gly Ala Ala Gly 
25 

Ser Val Ala He 

Ala Pro Leu Val 

60 

Glu Leu Ala Lys 
75 

Arg Glu Gly Val 
90 

Asp He Glu Thr 
105 

Thr Thr Gly Thr 

Tyr Phe Gly Lys 

14 0 

Phe Lys Gly Lys 
155 

He Ala Ala He 
170 

Glu Tyr Met Pro 
185 

Lys Lys Arg Asn 

Val Gly Asp Gly 

220 

Thr Gly Glu Glu 
235 

Gly Leu He Pro 
250 

Asp Glu Arg Gly 
265 

Pro Gly Val Tyr 

He Ala Ser Ala 

300 

Ser Asp Ser He 
315 



Lys Glu Arg Asp 
15 

Phe Ser Ala Ala 
30 

Leu Asp Lys Ala 
45 

Glu Asn Tyr Leu 

Leu Phe Ala Asp 

80 

Glu Val Arg Ser 
95 

Asn Asp Asp Thr 
110 

Thr His Lys His 
125 

Gly Thr Ser Tyr 

Arg Val Val Thr 

160 

Ser Met Ser Glu 
175 

Lys Tyr Met Cys 
190 

He Pro Tyr He 
205 

Lys Lys Val Thr 

Lys Leu He Glu 

240 

Gin Thr Ser Phe 
255 

Tyr He Val Val 
270 

Ala Ala Gly Asp 
285 

Val Gly Asp Gly 
Ser Lys Lys 



<210> 279 
<211> 317 
<212> PRT 
<213> Thermotoga 



maritima 
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<212> PRT 
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<210> 282 
<211> 318 
<212> PRT 

<213> Vibrio cholerae 
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345 

Tyr Val Ser Leu 

Val Thr Val Met 

380 

Met Ala Glu Arg 
395 

Glu Ala Gly Val 
410 

Glu Lys Ala Gly 
425 

Thr Gly Glu Met 

Ala He Gly Arg 

460 

Gly Val Glu Arg 
475 

Ser Thr Thr lie 
490 

Thr Pro Glu Leu 



Arg Gin Pro Arg 
15 

Ser Asp Thr Leu 
30 

Lys Arg Pro Ala 
45 

Val Asp Ala Pro 

Asn Ser Lys lie 

80 

Leu Val Glu Phe 
95 

Ala Glu Lys Pro 
110 

Arg Ala Ser Gin 
125 

Pro Leu Met Tyr 

Lys Ala Leu Lys 

160 

Tyr Asp, Leu He 
175 

Lys Glu Ala Ser 
190 

Lys Pro Ser Pro 
205 

Asn Val Gly Cys 

Gly His Ser He 

240 

Gly Lys Val Glu 

2 55 

His He Ala Ser 
270 

Thr Val Thr Tyr 
285 

He Ser Ala Thr 

Arg Phe Leu He 

32 0 

Gly Val Lys Glu 

3 35 

Tyr Ser Pro Gly 
350 

Glu Cys Ala Gly 
365 

Val Arg Ser He 

He Arg Lys His 

400 

Pro Thr Arg He 
415 

Lys Tyr Arg Val 
430 

Gin Glu Val Ser 
445 

Glu Ala Val Thr 

Ala Lys Ser Lys 

480 

Pro Trp Val Tyr 
495 

Thr Pro Val Ala 
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He 


Gin 


Ala 


Gly 


Arg 


Val 


Leu 


Met 


Arg Arg 


He 


Phe 


Asp 


Gly Ala Asn 






515 










520 








525 








Glu 


Leu 


Thr 


Glu 


Tyr 


Asp 


Gin 


He 


Pro Thr 


Thr 


Val 


Phe 


Thr 


Pro 


Leu 




530 








535 








540 










Glu 


Tvr 


Gly 


Cys 


Cys 


Gly 


Leu 


Ser 


Glu Glu Asp 


Ala 


Met 


Met 


Lys 


Tyr 


545 










550 








555 










560 


Gly 


Lys 


Asp 


Asn 


He 


He 


He 


Tyr 


His Asn 


Val 


Phe 


Asn 


Pro 


Leu 


Glu 










565 








570 










575 




Tyr 


Thr 


He 


Ser 


Glu 


Arg 


Met 


Asp 


Lys Asp 


His 


Cys 


Tyr 


Leu 


Lys 


Met 






580 










585 








590 






He 


Cys 


Leu 


Arg 


Asn 


Glu 


Glu 


Glu 


Lys Val 


Val 


Gly 


Phe 


His 


He 


Leu 




595 








600 








605 








Thr 


Pro 


Asn 


Ala 


Gly 


Glu 


Val 


Thr 


Gin Gly 


Phe 


Gly 


He 


Ala 


Leu 


Lys 




610 








615 








620 










Leu 


Ala 


Ala 


Lys 


Lys 


Ala 


Asp 


Phe 


Asp Arg 


Leu 


He 


Gly 


He 


His 


Pro 


625 






630 








635 










640 


Thr 


Val 


Ala 


Glu 


Asn 


Phe 


Thr 


Thr 


Leu Thr 


Leu 


Glu 


"Lys 


Lys 


Glu 


Gly 










645 








650 










655 




Asp 


Glu 


Glu 


Leu 


Gin 


Ala 


Ser 


Gly 


Cys Xaa Gly 













660 665 



<210> 289 
<211> 516 
<212> PRT : 

<213> Drosophila melanogaster 



<400> 289 



Met 


Ser 


Thr 


He 


Lys 


Phe 


Leu 


Arg 


Ser 


Ser 


Thr 


His 


Asn 


Ala 


Leu 


Arg 


1 








5 










10 










15 




Ser 


Ser 


Leu 


Gly 
20 


Trp 


Cys 


Arg 


Leu 


Ala 
25 


Ala 


Ser 


Arg 


Pro 


Arg 
30 


Tyr 


Asp 


Tyr 


Asp 


Leu 


Val 


Val 


Leu 


Gly 


Gly 


Gly 


Ser 


Ala 


Gly 


Leu 


Ala 


Cys 


Ala 


35 










40 










45 








Lys 


Glu 


Ala 


Ala 


Gly 


Cys 


Gly 


Ala 


Arg 


Val 


Leu 


Cys 


Phe 


Asp 


Tyr 


Val 


50 










55 










60 










Lys 


Pro 


Thr 


Pro 


Val 


Gly 


Thr 


Lys 


Trp 


Gly 


He 


Gly 


Gly 


Thr 


Cys 


Val 


65 










70 










75 










80 


Asn 


Val 


Gly 


Cys 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ser 


Leu 


Leu 






85 










90 










95 




Gly 


Glu 


Ala 


Val 


His 


Glu 


Ala 


Val 


Ala Tyr Gly 


Trp 


Asn 


Val 


Asp 


Asp 






100 










105 










110 






Thr 


Asn 


He 


Arg 


Pro 


Asp 


Trp 


Arg 


Lys 


Leu 


Val 


Arg 


Ser 


Val 


Gin 


Asn 






115 








120 










125 








His 


He 


Lys 


Ser 


Val 


Asn 


Trp 


Val 


Thr 


Arg 


Val 


Asp 


Leu 


Arg 


Asp 


Lys 




130 








135 










14 0 






His 




Lys 


Val 


Glu 


Tyr 


Val 


Asn 


Ser 


Met 


Ala 


Thr 


Phe 


Arg 


Asp 


Ser 


Thr 


145 








150 










155 










160 


He 


Glu 


Tyr 


Val 


Ala 


Met 


Pro 


Gly 


Ala 


Glu 


His 


Arg 


Gin 


Val 


Thr 


Ser 








165 










170 










175 




Glu 


Tyr 


Val 


Val 


Val 


Ala 


Val 


Gly 


Gly 


Arg 


Pro 


Arg 


Tyr 


Pro 


Asp 


He 






180 










185 










190 






Pro 


Gly 


Ala 


Val 


Glu 


Leu 


Gly 


He 


Thr 


Ser 


Asp 


Asp 


He 


Phe 


Ser 


Tyr 




195 










200 










205 








Glu 


Arg 


Glu 


Pro 


Gly 


Arg 


Thr 


Leu 


Val 


Val 


Gly 


Ala 


Gly 


Tyr 


Val 


Gly 




210 








215 










220 










Leu 


Glu 


Cys 


Ala 


Cys 


Phe 


Leu 


Lys 


Gly 


Leu 


Gly 


Tyr 


Glu 


Pro 


Thr 


Val 


225 






230 










235 










240 


Met 


Val 


Arg 


Ser 


He 


Val 


Leu 


Arg 


Gly 


Phe 


Asp 


Arg 


Gin. Met 


Ser 


Glu 








245 










250 










255 




Leu 


Leu 


Ala 


Ala 


Met 


Met 


Thr 


Glu 


Arg 


Gly 


He 


Pro 


Phe 


Leu Gly 


Thr 








260 










265 










270 






Thr 


He 


Pro 


Lys 


Ala 


Val 


Glu 


Arg 


Gin 


Ala 


Asp 


Gly Arg 


Leu 


Leu 


Val 






275 








280 










285 








Arg 


Tyr 


Arg 


Asn 


Thr 


Thr 


Thr 


Gin 


Met 


Asp 


Gly 


Ser Asp 


Val 


Phe 


Asp 


290 








295 










300 
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Thr 


Val 


LiRll 




Ala 


He 


Glv 


Arcr 


Lvs 


Glv 


Leu 


He 


Glu 


Asp 


Leu 


Asn 


305 










310 










315 










320 


Leu 

*mr KJL 


Agn 


Ala 


Ala 


Gly Val 


LVS 


Thr 


His 


Asp 


Asp 


Lys 


He 


Val 


Val 


Asp 










325 










330 










335 




Ala 


Ala 


Glu 


Ala 
340 


Thr 


Ser 


Val 


Pro 


His 
345 


He 


Phe 


Ala 


Val 


Gly 
350 


Asp 


He 


lie 


Tvr 


Gly 
355 


Arq 


Pro 


Glu 


Leu 


Thr 
360 


Pro 


Val 


Ala 


He 


Leu 
365 


Ser 


Gly 


Arg 


Leu 


Leu 

370 


Ala 


Arq 


Arg 


Leu 


Phe 
375 


Ala 


Gly 


Ser 


Thr 


Gin 
380 


Leu 


Met 


Asp 


Tyr 


Ala 


Asp 


Val 


Ala 


Thr 


Thr 


Val 


Phe 


Thr 


Pro 


Leu 


Glu 


Tyr 


Ser 


Cys 


Val 


3 85 








390 










395 










400 


Glv 


Met 


Ser 

krf V-^ 


Glu 


Glu 


Thr 


Ala 


He 


Glu 


Leu 


Arg Gly Ala Asp 


Asn 


He 








405 










410 










415 




Glu 


Val 


Phe 


His 


Gly Tyr 


Tvr 


Lvs 


Pro 


Thr 


Glu 


Phe 


Phe 


He 


Pro 


Gin 








420 










425 










430 






TiVR 


W ^ -L_ 


Val 
435 




His 


Cys 




Leu 
440 


Lvs 


Ala 


Val 


Ala 


Glu 
445 


Val 


Ser 


Gly 


Asp 


Gin 
450 


Lys 


He 


Leu 


Gly 


Leu 
455 


His 


Tyr 


He 


Gly 


Pro 
460 


Val 


Ala 


Gly 


Glu 


Val 


lie 


Gin 


Gly 


Phe 


Ala 


Ala 


Ala 


Leu 


Lys 


Thr 


Gly 


Leu 


Thr 


Val 


Lys 


465 










470 










475 










480 


Thr 


Leu 


Leu 


Asn 


Thr 


Val 


Gly 


He 


His 


Pro 


Thr 


Thr 


Ala 


Glu 


Glu 


Phe 










485 








490 










495 




Thr 


Arg 


Leu 


Ser 


He 


Thr 


Lys 


Arg 


Ser 


Gly 


Arg Asp 


Pro 


Thr 


Pro 


Ala 






500 










505 










510 






Ser 


Cys 


Cys 
515 


Ser 



























<210> 290 
<211> 5-24 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 523 

<2 23> Xaa = Any Amino Acid 



<400> 290 



Met 


Ala 


Ala 


Met 


Ala 


Val 


Ala 


Leu 


Arg 


Gly 


Leu Gly 


Gly Arg 


Phe 


Arg 


1 








5 










10 








15 




Trp 


Arg 


Thr 


Gin 


Ala 


Val 


Ala 


Gly 


Gly 


Val 


Arg Gly 


Ala 


Ala 


Arg 


Gly 








20 










25 








30 






Ala 


Ala 


Ala 


Gly 


Gin 


Arg 


Asp 


Tyr 


Asp 


Leu 


Leu Val 


Val 


Gly 


Gly 


Gly 






35 










40 








45 








Ser 


Gly 


Gly 


Leu 


Ala 


Cys 


Ala 


Lys 


Glu 


Ala 


Ala Gin 


Leu 


Gly Arg 


Lys 




50 










55 








60 










Val 


Ser 


Val 


Val 


Asp 


Tyr 


Val 


Glu 


Pro 


Ser 


Pro Gin Gly 


Thr 


Arg 


Trp 


65 










70 










75 








80 


Gly 


Leu 


Gly 


Gly 


Thr 


Cys 


Val 


Asn 


Val 


Gly 


Cys He 


Pro 


Lys 


Lys 


Leu 










85 










90 








95 




Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly Gly 


Leu 


He Gin 


Asp 


Ala 


Pro 


Asn 








100 










105 








110 






Tyr 


Gly 


Trp 


Glu 


Val 


Ala 


Gin 


Pro 


Val 


Pro 


His Asp 


Trp 


Arg 


Lys 


Met 




115 










12 0 








125 








Ala 


Glu 


Ala 


Val 


Gin 


Asn 


His 


Val 


Lys 


Ser 


Leu Asn 


Trp 


Gly 


His 


Arg 




130 










135 








140 










Val 


Gin 


Leu 


Gin Asp Arg 


Lys 


Val 


Lys 


Tyr 


Phe Asn 


He 


Lys 


Ala 


Ser 


145 










150 










155 








160 


Phe 


Val 


Asp 


Glu 


His 


Thr 


Val 


Cys 


Gly 


Val 


Ala Lys 


Gly Gly 


Lys 


Glu 










165 










170 








175 




He 


Leu 


Leu 


Ser 


Ala 


Asp 


His 


He 


He 


He 


Ala Thr 


Gly 


Gly 


Arg 


Pro 








180 








185 








190 






Arg 


Tyr 


Pro 


Thr 


His 


He 


Glu 


Gly Ala 


Leu 


Glu Tyr 


Gly 


He 


Thr 


Ser 


195 










200 








205 
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Asp Asp lie Pine 
210 

Gly Ala Ser Tyr 
225 

Gly Leu Asp Thr 

Asp Gin Gin Met 

260 

Thr Arg Phe Leu 
275 

Asp Gly Gin Leu 
290 

Asp Thr Gly Thr 
305 

Asp Thr Arg Ser 

Asp Thr Gin Lys 

340 

His lie Tyr Ala 
355 

Pro lie Ala lie 
370 

Gly Ser Ser Asp 
385 

Thr Pro Leu Glu 

Ala Arg His Gly 

420 

Pro Leu Glu Phe 
435 

Lys Met Val Cys 
450 

Phe Leu Gly Pro 
465 

lie Lys Cys Gly 

His Pro Thr Cys 

500 

Ser Gly Leu Asp 
515 



Trp Leu Lys Glu 
215 

Val Ala Leu Glu 
230 

Thr lie Met Met 
245 

Ser Ser Met Val 

Arg Gly Cys Ala 

2 80 

Gin Val Thr Trp 
295 

Phe Asp Thr Val 
310 

Leu Asn Leu Glu 
325 

lie Leu Val Asp 

lie Gly Asp Val 

360 

Met Ala Gly Arg 
375 

Leu Met Asp Tyr 
390 

Tyr Gly Cys Val 
405 

Gin Glu His Val 

Thr Val Ala Gly 

440 

Leu Arg Glu Pro 
455 

Asn Ala Gly Glu 
470 

Ala Ser Tyr Ala 
485 

Ser Glu Glu Val 

Pro Thr Val Thr 

520 



Ser Pro Gly Lys 

220 

Cys Ala Gly Phe 
235 

Arg Ser lie Pro 
250 

lie Glu His Met 
265 

Pro Ser Arg Val 

Glu Asp Ser Thr 

300 

Leu Trp Ala lie 
315 

Lys Ala Gly Val 
330 

Ser Arg Glu Ala 
345 

Val Glu Gly Arg 

Leu Leu Val Gin 

380 

Asp Asn Val Pro 
3 95 

Gly Leu Ser Glu 
410 

Glu Val Tyr His 
425 

Arg Asp Ala Ser 

Pro Gin Leu Val 

460 

Val Thr Gin Gly 
475 

Gin Val Met Arg 
490 

Val Lys Leu Arg 
505 

Gly Cys Xaa Gly 



Thr Leu Val Val 

Leu Thr Gly lie 

240 

Leu Arg Gly Phe 
255 

Ala Ser His Gly 
270 

Arg Arg Leu Pro 
2 85 

Thr Gly Lys Glu 

Gly Arg Val Pro 

320 

Asp Thr Ser Pro 
335 

Thr Ser Val Pro 
350 

Pro Glu Leu Thr 
365 

Arg Leu Phe Gly 

Thr Thr Val Phe 

400 

Glu Glu Ala Val 
415 

Ala His Tyr Lye 
430 

Gin Cys Tyr Val 
445 

Leu Gly Leu His 

Phe Ala Leu Gly 

480 

Thr Val Gly lie 
495 

lie Ser Lys Arg 
510 



<210> 291 
<211> 497 
<212> PRT 

<213> Homo sapiens 



<400> 291 



Met 


Asn 


Gly 


Pro 


Glu 


1 








5 


lie 


lie 


Gly 


Gly 


Gly 








20 




Gin 


Tyr 


Gly 


Lys 


Lys 






35 






Leu 


Gly 


Thr 


Arg 


Trp 




50 








lie 


Pro 


Lys 


Lys 


Leu 


65 










Gin 


Asp 


Ser 


Arg 


Asn 










85 


Asp 


Trp 


Asp 


Arg 


Met 








100 




Asn 


Trp 


Gly 


Tyr 


Arg 






115 






Asn 


Ala 


Tyr 


Gly 


Gin 




130 








Asn 


Lys 


Gly 


Lys 


Glu 



Asp Leu Pro Lys Ser Tyr 

10 

Ser Gly Gly Leu Ala Ala 

25 

Val Met Val Leu Asp Phe 
40 

Gly Leu Gly Gly Thr Cys 
55 

Met His Gin Ala Ala Leu 
70 75 
Tyr Gly Trp Lys Val Glu 

90 

lie Glu Ala Val Gin Asn 

105 

Val Ala Leu Arg Glu Lys 
120 

Phe lie Gly Pro His Arg 

135 

Iiys He Tyr Ser Ala Glu 



Asp Tyr Asp Leu He 

15 

Ala Lys Glu Ala Ala 
30 

Val Thr Pro Thr Pro 
45 

Val Asn Val Gly Cys 
60 

Leu Gly Gin Ala Leu 

80 

Glu Thr Val Lys His 

95 

His He Gly Ser Leu 
110 

Lys Val Val Tyr Glu 
125 

He Lys Ala Thr Asn 
140 

Arg Phe Leu He Ala 
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145 










150 










155 










160 


1 111. 


V31 jr 


Glu 


Arg 


Pro 


±\±. y 


lyr 




uiy 


116 


Jrro 


Gly Asp 


ify s 


m 1 1 

Lai U 


lyr 










165 










170 










175 






Tie 

x x c 


Ser 


Ser 
180 


Asp 




T i 

ucr L*. 


IT IXC 


0 tr X 

1 H5 

X O J 


T.aii 
XJC? LI 


ir X CJ 


lyr 


Lyo 


1 9fl 

J. «7 U 


Laiy 


iiy s 


Thr 


TiPi i 


Val 
195 


Val 


Gly 


Al a 


tJ X 


- 1 - y x 

200 

M V 


Va 1 

v a. x 


Al a 


xtc: ix 


L3X IX 


Ly 0 
205 

w J 


Al 3 

/IX Cl 


R1 \r 
uiy 


xr lit; 


X) G KJ. 


AT a 

.rtX ct 

210 

u pL V 


Gly 


He 


Gly 


Lie LL 


215 

ij X J 


V Gil 


. llll 


Vdl 


i v lc L, 


\7zi 1 

v ai 

& £t V 


Aig 


C7 ei 


Tl 0 
11c 


lie U 


T ,Q 1 1 




Gly Phe Asp 


Lzlil 


M.Sp 




Aid 




iiy s 


Tl 0 

lie 


Gly 


VjlU 


m S 


Mel- 


ooc 










o n 










/ J J 










9 a n 
z *± u 


uXU 


vjlU 


nx a 


Gly 


He 
245 


Lys 


rile 


ixe 


Arg 


250 


Pne 


vai 


Pro 


He 


Lys 
255 


vai 


bill 


bin 


Tl 
lie 


Glu 
260 


Ala 


(-11 


inr 


Fro 


ni,, 
(j±y 

265 


Arg 


ijeu 


Arg 


Val 


Val 
270 


Ala 

Aia 


Lrin 


oer 


np Vy v» 


21 en 


Ser 


Glu 


LjIU 


i ±e 


lie 




tsxy 


bill 


xyr 


Asn 


Thr 


vai 


McL 






275 










9 ft n 

(j o u 






285 








T i 

uc u. 


Al a 


lie 


Gly Arg 


nop 


A 1 -j 
rtld 


Lys 


1X1X7 


/ixy 


jjys 


He 


Gly 


Leu 


ril n 


TV* t" 
1 111 




290 

J w 










OQC 

<j _7 J 










300 










V CX X 


C?l \r 
<crxy 


Val 


Lys 


He 


1\ cn 






X 111 


Gly 


Lys 


He 


Pro 


Val 


TVl "¥~ 
1X1X7 


>L»Dp 


305 










310 










315 










320 


Glu 

U X LA. 


Glu 

L3 X u. 


Gin 


Thr 


Asn 


\7^1 
val 


irLU 


Ayr 


Tip 

lie 


Tyr 


Ala 


He 


Gly Asp 


Tl f=» 
lie 


T • Al 1 

lieu 










325 










330 










335 




Ol Li 


A en 


Lys 


Val 
340 


Glu 


XJGL1 




riO 


Val 
345 


Ala 


He 


Gin 


Ala 


Gly 
350 


Miry 


jjeu 


T ,oi i 
JJCLl 


/i± ci 


Gin 
355 


Arg 


Leu 


lyr 


H.X Si 


Liiy 

J u u 


Ser 


Thr 


Val 


Lys 


Cys 
365 


Asp 


Tyr 


r 1 ! 11 


Anil 


V cil 


Pro 


Thr 


Thr 


Vdl 


rile 


rpL -y- 

inr 


Pro 


Leu 


Glu 


Tyr 


Gly Ala 


LyS 


uxy 




•170 










J / o 










380 










UC LI 


CpT 
OCI 


Glu 


Glu 


Lys 




V ctX 


ulU 


Lys 


Phe Gly Glu Glu Asia 


Tl 0 

lie 


m n 

blU 


3 85 

*J W _i 










3 90 










395 










it y w 


v ei x 


A yr 


His 


Ser 


Tyr 
405 




1 x p 


IrX LJ 


Leu 


Glu 
410 


Trp 


Thr 


He 


Pro 


OCX 

415 


Ti ~y~* — f 

Aig 


Asp 


Asn 


Asn 


Lys 
420 


Cys 


Tyr 


Ala 


Lys 


He 
425 


He 


Cys 


Asn 


Thr 


Lys 
430 


Asp 


Asn 


Glu 


Arg 


Val 


Val 


Gly 


Phe 


His 


Val 


Leu Gly 


Pro 


Asn 


Ala 


Gly 


Glu 


Val 






435 










440 










445 








Thr 


Gin 


Gly Phe Ala 


Ala 


Ala 


Leu 


Lys 


Cys 


Gly 


Leu 


Thr 


Lys 


Lys 


Gin 




450 










455 










460 










Leu 


Asp 


Ser 


Thr 


He 


Gly 


He 


His 


Pro 


Val 


Cys 


Ala 


Glu 


Val 


Phe 


Thr 


465 










470 










475 










480 


Thr 


Leu 


Ser 


Val 


Thr 


Lys 


Arg 


Ser 


Gly Ala Arg 


He 


Leu 


Gin 


Ala 


Gly 










485 










490 










495 




Cys 

































<210> 292 
<211> 497 
<212> PRT 
<213> Homo sapien 



<400> 292 



Met 


Asn 


Gly 


Pro 


Glu 


Asp 


Leu 


Pro 


Lys Ser 


Tyr 


Asp 


Tyr 


Asp 


Leu 


He 


1 








5 








10 










15 




He 


He 


Gly 


Gly 
20 


Gly 


Ser 


Gly 


Gly 


Leu Ala 
25 


Ala 


Ala 


Lys 


Glu 
30 


Pro 


Ala 


Gin 


Tyr 


Gly 
35 


Lys 


Lys 


Val 


Met 


Val 
40 


Leu Asp 


Phe 


Gly 


Thr 
45 


Pro 


Thr 


Pro 


Leu 


Gly 
50 


Thr 


Arg 


Trp 


Gly 


Leu 
55 


Gly 


Gly Thr 


Cys 


Val 
60 


Asn 


Val 


Gly 


Cys 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala Ala 


Leu 


Leu Gly 


Gin 


Ala 


Leu 


65 










70 








75 










80 


Gin 


Asp 


Ser 


Arg 


Asn 
85 


Tyr 


Gly 


Trp 


Lys Val 
90 


Glu 


Glu 


Thr 


Val 


Lys 
95 


His 


Asp 


Trp 


Asp 


Arg 
100 


Met 


He 


Glu 


Ala 


Val Gin 
105 


Asn 


His 


He 


Gly 
110 


Ser 


Leu 
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As 11 


Trr> 


Glv 


Tvr 






115 




Asn 


Ala 


Tvr 


Glv 




130 






Asn 


TjVfl 

XJJr U 


Glv 


Lvs 


145 








Thr 


Gly 


Glu 


Arg 


Cvs 


lie 


Ser 

kd> Vrf* w*^ 


Ser 








180 


Thr 


Leu 


Val 


Val 






195 




Lieu 


Ala 


Glv 


He 




210 










Glv 


Phe 


225 








Glu 


Glu 


His 


Gly 


Glu 

\J X 


Gin 


He 

*Jh V** 


Glu 

^ 








260 


Ser 


Thr 


Asn 


Ser 






275 




Leu 


Ala 


He 


Gly 




290 






Val 


Gly 


Val 


Lys 


305 








Glu 


Glu 


Gin 


Thr 


Gl n 


ZV <3T* 


Lys 


Val 








340 






Gin Arg 






355 




Asn 


Val 


Pro 


Thr 




370 






Leu 


Ser 


Glu 


Glu 


385 








Val 


Tyr 


His 


Ser 


Asp 


Asn 


Asn 


Lys 








420 


Glu 


Arg 


Val 


Val 






435 




Thr 


Gin 


Gly 


Phe 




450 






Leu 


Asp 


Ser 


Thr 


4S5 








Thr 


Leu 


Ser 


Val 


Cys 









Arg Val Ala Leu 

120 

Gin Phe He Gly 
135 

Glu Lys He Tyr 
150 

Pro Arg Tyr Leu 
165 

Asp Asp Leu Phe 

Gly Ala Ser Tyr 

200 

Gly Leu Asp Val 
215 

Asp Gin Asp Met 
230 

He Lys Phe He 
245 

Ala Gly Thr Pro 

Glu Glu He He 

280 

Arg Asp Ala Cys 
295 

He Asn Glu Lys 
310 

Asn Val Pro Tyr 
325 

Glu Leu Thr Pro 

Leu Tyr Ala Gly 

360 

Thr Val Phe Thr 
375 

Lys Ala Val Glu 
390 

Tyr Phe Trp Pro 
405 

Cys Tyr Ala Lys 

Gly Phe His Val 

440 

Ala Ala Ala Leu 
455 

He Gly He His 
470 

Thr Lys Arg Ser 
485 



Arg Glu Lys Lys 

Pro His Arg He 

14 0 

Ser Ala Glu Arg 
155 

Gly He Pro Gly 
170 

Ser Leu Pro Tyr 
185 

Val Ala Leu Glu 

Thr Val Met Val 

220 

Ala Asn Lys He 
235 

Arg Gin Phe Val 
250 

Gly Arg Leu Arg 
265 

Glu Gly Glu Tyr 

Thr Arg Lys He 

300 

Thr Gly Lys He 
315 

He Tyr Ala He 
330 

Val Ala He Gin 
345 

Ser Thr Val Lys 

Pro Leu Glu Tyr 

380 

Lys Phe Gly Glu 
3 95 

Leu Glu Trp Thr 
410 

He He Cys Asn 
425 

Leu Gly Pro Asn 

Lys Cys Gly Leu 

460 

Pro Val Cys Ala 
475 

Gly Ala Ser He 
490 



Val Val Tyr Glu 
125 

Lys Ala Thr Asn 

Phe Leu He Ala 

160 

Asp Lys Glu Tyr 
17 5 

Cys Pro Gly Lys 
190 

Cys Ala Gly Phe 
205 

Arg Ser He Leu 

Gly Glu His Met 

240 

Pro He Lys Val 
255 

Val Val Ala Gin 
270 

Asn Thr Val Met 
285 

Gly Leu Glu Thr 

Pro Val Thr Asp 

320 

Gly Asp He Leu 
335 

Ala Gly Arg Leu 
350 

Cys Asp Tyr Glu 
365 

Gly Ala Cys Gly 

Glu Asn He Glu 

400 

He Pro Ser Arg 
415 

Thr Lys Asp Asn 
430 

Ala Gly Glu Val 
445 

Thr Lys Lys Gin 

Glu Val Phe Thr 

480 

Leu Gin Ala Gly 
495 



<210> 293 

<211> 521 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> VARIANT 
<222> 520 

<223> Xaa = Any Amino Acid 
<400> 293 

Met Ala Val Ala Leu Arg Gly Leu 

1 5 
Gin Ala Val Ala Gly Gly Val Arg 

20 



Gly Gly Arg Phe Arg Trp Arg Thr 

10 15 
Gly Ala Ala Arg Gly Ala Ala Ala 
25 30 
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Gly 


Gin 


Arg Asp 


Tyr 


Asp 


Leu 


Leu 


Val 


Val 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 






35 










40 










45 








Leu 


Ala 


Cys 


Ala 


Lys 


Glu 


Ala 


Ala 


Gin 


Leu 


Gly Arg Lys 


Val 


Ala 


Val 




50 










55 










60 










Val 


Asp 

XT 


Tyr 


val 


Glu 


Pro 


Ser 


Pro 


Gin 


Gly 


Thr 


Arg 


Trp 


Gly 


Leu 


Gly 


65 










70 










75 










80 


Gly 


Thr 


Cys 


Val 


Asn 


Val 


Gly Cys 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 










85 










90 










95 




Ala 


Ala 


Leu 


Leu 


Gly Gly 


Leu 


He 


Gin 


Asp 


Ala 


Pro 


Asn 


Tyr 


Gly 


Trp 








100 










105 










110 






Glu 


Val 


Ala 


Gin 


Pro 


Val 


Pro 


His 


Asp 


Trp 


Arg 


Lys 


Met 


Ala 


Glu 


Ala 






115 










120 










125 








Val 


Gin 


Asn 


His 


Val 


Lys 


Ser 


Leu 


Asn 


Trp 

IT 


Gly 


His 


Arq 


Val 


Gin 


Leu 




130 










135 










140 










Gin 


Asp 


Arg 


Lys 


Val 


Lys 


Tyr 


Phe 


Asn 


He 


Lys 


Ala 


Ser 


Phe 


Val 


Asp 


145 










150 










155 










160 


Glu 


His 


Thr 


Val 


Cys 


Gly Val 


Ala 


Lys 


Gly 


Gly 


Lys 


Glu 


He 


Leu 


Leu 










165 










170 










175 




Ser 


Ala 


Asp 


His 


He 


He 


He 


Ala 


Thr 


Gly 
•* 


Gly Arg 


Pro 


Arg 


Tyr 


Pro 








180 










185 










190 






Thr 


His 


He 


Glu 


Gly Ala 


Leu 


Glu 


Tyr 


Gly 


He 


Thr 


Ser 


Asp 


Asp 


He 






195 










200 










205 








Phe 


Trp 


Leu 


Lys 


Glu 


Ser 


Pro 


Gly 


Lys 


Thr 


Leu 


Val 


Val 


Gly 


Ala 


Ser 




210 










215 










220 










Tyr 


Val 


Ala 


Leu 


Glu 


Cys 


Ala Gly 


Phe 


Leu 


Thr 


Gly 


He 


Gly 


Leu 


Asp 


225 










230 










235 










240 


Thr 


Thr 


He 


Met 


Met 


Arg 


Ser 


lie 


Pro 


Leu 


Arq 


Gly 


Phe 


Asp 


Gin 


Gin 










245 










250 










255 




Met 


Ser 


Ser 


Met 


Val 


He 


Glu 


His 


Met 


Ala 


Ser 


His 


Gly 


Thr 


Arg 


Phe 








260 










265 








270 






Leu 


Arg 


Gly 


Cys 


Ala 


Pro 


Ser 


Arg 


Val 


Arg 


Arq 


Leu 


Pro 


Asp 


Gly 


Gin 






275 










280 










285 








Leu 


Gin 


Val 


Thr 


Trp 


Glu 


Asp 


Ser 


Thr 


Thr 


Gly 


Lys 


Glu 


Asp 


Thr 


Gly 




290 










295 










300 










Thr 


Phe 


Asp 


Thr 


Val 


Leu 


Trp 


Ala 


He 


Gly 


Arg 


Val 


Pro 


Asp 


Thr 


Arg 


305 










310 










315 










320 


Ser 


Leu 


Asn 


Leu 


Glu 


Lys 


Ala Gly Val 


Asp 


Thr 


Ser 


Pro 


Asp 


Thr 


Gin 




• 






325 










330 










335 




Lys 


He 


Leu 


Val 


Asp 


Ser 


Arg 


Glu 


Ala 


Thr 


Ser 


Val 


Pro 


His 


He 


Tyr 








340 










345 










350 






Ala 


He 


Gly Asp 


Val 


Val 


Glu Gly Arg 


Pro 


Glu 


Leu 


Thr 


Pro 


He 


Ala 






355 










360 










365 








He 


Met 


Ala 


Gly 


Arg 


Leu 


Leu 


Val 


Gin 


Arq 


Leu 


Phe 


Gly Gly 


Ser 


Ser 




370 










375 










380 










Asp 


Leu 


Met 


Asp 


Tyr 


Asp 


Asn 


Val 


Pro 


Thr 


Thr 


Val 


Phe 


Thr 


Pro 


Leu 


385 






390 










395 










400 


Glu 


Tyr 


Gly 


Cys 


Val 


Gly 


Leu 


Ser 


Glu 


Glu 


Glu 


Ala 


Val 


Ala 


Arg 


His 










405 










410 










415 




Gly 


Gin 


Glu 


His 


Val 


Glu 


Val 


Tyr 


His 


Ala 


His 


Tyr 


Lys 


Pro 


Leu 


Glu 








420 










425 










430 






Phe 


Thr 


Val 


Ala 


Gly Arg 


Asp 


Ala 


Ser 


Gin 


Cys 


Tyr 


Val 


Lys 


Met 


Val 






435 










440 










445 








Cys 


Leu 


Arg 


Glu 


Pro 


Pro 


Gin 


Leu 


Val 


Leu 


Gly 


Leu 


His 


Phe 


Leu 


Gly 


450 










455 








460 










Pro 


Asn 


Ala Gly 


Glu 


Val 


Thr 


Gin 


Gly 


Phe 


Ala 


Leu 


Gly 


He 


Lys 


Cys 


465 










470 










475 










480 


Gly 


Ala 


Ser 


Tyr 


Ala 


Gin 


Val 


Met 


Arg 


Thr 


Val 


Gly 


He 


His 


Pro 


Thr 










485 










490 










495 




Cys 


Ser 


Glu 


Glu 


Val 


Val 


Lys 


Leu 


Arg 


He 


Ser 


Lys 


Arg 


Ser 


Gly 


Leu 








500 










505 










510 






Asp 


Pro 


Thr 


Val 


Thr 


Gly 


Cys 


Xaa Gly 




















515 










520 



















<210> 294 
<211> 579 
<212> PRT 
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<213> Homo sapiens 



<220> 

<221> VARIANT 
<222> 578 

<223> Xaa = Any Amino Acid 



<400> 294 
Ala Glu Arg Val 
1 

Arg Val Lys Glu 

20 

Glu Leu Asp Gin 
35 

Glu lie Thr Asn 
50 

His Val Gly Gly 
65 

Leu Gin Lys Leu 

lie lie Gly Gly 

100 

He Leu Gly Lys 
115 

Gin Gly Thr Ser 
130 

He Pro Lys Lys 
145 

Cys Asp Ser Arg 

Asn Trp Glu Thr 

180 

Asn Trp Gly Tyr 
195 

Asn Ser Tyr Gly 
210 

Lys Lys Gly Gin 
225 

Thr Gly Glu Arg 

Cys He Thr Ser 

260 

Thr Leu Val Val 
275 

Leu Ala Gly Phe 
290 

Leu Arg Gly Phe 
3 05 

Glu Gin His Gly 

Gin Gin Leu Glu 

340 

Ser Thr Glu Gly 
355 

Leu Ala He Gly 
370 

He Gly Val Lys 
3B5 

Val Glu Gin Thr 

Glu Asp Lys Pro 

420 

Leu Ala Gin Arg 
435 

Asn Val Pro Thr 
450 

Leu Ser Glu Glu 



Val He Phe Ser 
5 

Leu Phe Ser Ser 

Val Asp Asp Gly 

40 

Gin Lys Thr Val 
55 

Cys Asp Gin Thr 
70 

Leu Gin Glu Asp 
85 

Gly Ser Gly Gly 

Lys Val Met Val 

120 

Trp Gly Leu Gly 
135 

Leu Met His Gin 
150 

Lys Phe Gly Trp 
165 

Met Thr Lys Ala 

Arg Leu Ser Leu 

200 

Glu Phe Val Glu 
215 

Glu Thr Tyr Tyr 
230 

Pro Arg Tyr Leu 
245 

Asp Asp Leu Phe 

Gly Ala Ser Tyr 

280 

Gly Leu Asp Val 
295 

Asp Gin Glu Met 
310 

Val Lys Phe Leu 
325 

Lys Gly Ser Pro 

Thr Glu Thr He 

360 

Arg Asp Ser Cys 
375 

He Asn Glu Lys 
390 

Asn Val Pro Tyr 
405 

Glu Leu Thr Pro 

■ 

Leu Phe Gly Ala 

440 

Thr Val Phe Thr 
455 

Lys Ala He Glu 



Lys Ser Tyr Cys 
10 

Leu Gly Val Glu 
25 

Ala Arg Val Gin 

Pro Asn He Phe 

60 

Phe Gin Ala Tyr 
75 

Leu Ala Tyr Asp 
90 

Leu Ser Cys Ala 
105 

Leu Asp Phe Val 

Gly Thr Cys Val 

140 

Ala Ala Leu Leu 
155 

Glu Tyr Asn Gin 
170 

He Gin Asn His 
185 

Arg Glu Lys Ala 

His His Lys He 

220 

Thr Ala Ala Gin 
235 

Gly He Gin Gly 
250 

Ser Leu Pro Tyr 
265 

Val Ala Leu Glu 

Thr Val Met Val 

300 

Ala Glu Lys Val 
315 

Arg Lys Phe He 
330 

Gly Lys Leu Lys 
345 

Glu Gly Val Tyr 

Thr Arg Lys He 

3 80 

Ser Gly Lys He 
395 

Val Tyr Ala Val 
410 

Val Ala He Gin 
425 

Ser Leu Glu Lys 

Pro Leu Glu Tyr 

460 

Val Tyr Lys Lys 



Pro His Ser Thr 
15 

Cys Asn Val Leu 
30 

Glu Val Leu Ser 
45 

Val Asn Lys Val 

Gin Ser Gly Leu 

80 

Tyr Asp Leu He 
95 

Lys Glu Ala Ala 
110 

Val Pro Ser Pro 
125 

Asn Val Gly Cys 

Gly Gin Ala Leu 

160 

Gin Val Arg His 
175 

He Ser Ser Leu 
190 

Val Ala Tyr Val 
2 05 

Lys Ala Thr Asn 

Phe Val He Ala 

240 

Asp Lys Glu Tyr 
255 

Cys Pro Gly Lys 
270 

Cys Ala Gly Phe 
285 

Arg Ser He Leu 

Gly Ser Tyr Met 

32 0 

Pro Val Met Val 
335 

Val Leu Ala Lys 
350 

Asn Thr Val Leu 
365 

Gly Leu Glu Lys 

Pro Val Asn Asp 

400 

Gly Asp He Leu 
415 

Ser Gly Lys Leu 
430 

Cys Asp Tyr He 
445 

Gly Cys Cys Gly 
Glu Asn Leu Glu 
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465 








lie 


Tyr 


His 


Thr 


Glu 


Asn 


Asn 


Thr 








500 


Asp 


Arg 


Val 


He 






515 




Thr 


Gin 


Gly 


Phe 




530 






Leu 


Asp 


Asp 


Thr 


545 








Thr 


Leu 


Glu 


He 


Cys 


Xaa 


Gly 





470 

Leu Phe Trp Pro 
485 

Cys Tyr Ala Lys 

Gly Phe His He 

520 

Ala Ala Ala Met 
535 

He Gly He His 
550 

Thr Lys Ser Ser 
565 



475 

Leu Glu Trp Thr 
490 

He He Cys Asn 
505 

Leu Gly Pro Asn 

Lys Cys Gly Leu 

54 0 

Pro Thr Cys Gly 
555 

Gly Leu Asp He 
570 



480 

Val Ala Gly Arg 
495 

Lys Phe Asp His 
510 

Ala Gly Glu Val 
525 

Thr Lys Gin Leu 

Glu Val Phe Thr 

560 

Thr Gin Lys Gly 
575 



<210> 295 
<211> 524 
<212> PRT 
<213> Homo sapien 



<220> 

<221> VARIANT 
<222> 523 

<223> Xaa = Any Amino Acid 



<400> 295 
Met Ala Ala Met 
1 

Trp Arg Thr Gin 

20 

Ala Ala Ala Gly 

35 

Ser Gly Gly Leu 
50 

Val Ala Val Val 
65 

Gly Leu Gly Gly 

Met His Gin Ala 

100 

Tyr Gly Trp Glu 
115 

Ala Glu Ala Val 

130 

Val Gin Leu Gin 
145 

Phe Val Asp Glu 

He Leu Leu Ser 

180 

Arg Tyr Pro Thr 
195 

Asp Asp He Phe 
210 

Gly Ala Ser Tyr 
225 

Gly Leu Asp Thr 

Asp Gin Gin Met 

26 0 

Thr Arg Phe Leu 
275 

Asp Gly Gin Leu 
290 

Asp Thr Gly Thr 



Ala Val Ala Leu 
5 

Ala Val Ala Gly 

Gin Arg Asp Tyr 

40 

Ala Cys Ala Lys 
55 

Asp Tyr Val Glu 
70 

Thr Cys Val Asn 
85 

Ala Leu Leu Gly 

Val Ala Gin Pro 

120 

Gin Asn His Val 
135 

Asp Arg Lys Val 
150 

His Thr Val Cys 
165 

Ala Asp His He 

His He Glu Gly 

200 

Trp Leu Lys Glu 
215 

Val Ala Leu Glu 
230 

Thr He Met Met 
245 

Ser Ser Met Val 

Arg Gly Cys Ala 

280 

Gin Val Thr Trp 
295 

Phe Asp Thr Val 



Arg Gly Leu Gly 
10 

Gly Val Arg Gly 
25 

Asp Leu Leu Val 

Glu Ala Ala Gin 

60 

Pro Ser Pro Gin 
75 

Val Gly Cys He 
90 

Gly Leu He Gin 
105 

Val Pro His Asp 

Lys Ser Leu Asn 

140 

Lys Tyr Phe Asn 
155 

Gly Val Ala Lys 
170 

He He Ala Thr 
185 

Ala Leu Glu Tyr 

Ser Pro Gly Lys 

220 

Cys Ala Gly Phe 
235 

Arg Ser He Pro 
250 

He Glu His Met 
265 

Pro Ser Arg Val 

Glu Asp Ser Thr 

300 

Leu Trp Ala He 



Gly Arg Phe Arg 
15 

Ala Ala Arg Gly 
30 

Val Gly Gly Gly 
45 

Leu Gly Arg Lys 

Gly Thr Arg Trp 

80 

Pro Lys Lys Leu 
95 

Asp Ala Pro Asn 
110 

Trp Arg Lys Met 
125 

Trp Gly His Arg 

He Lys Ala Ser 

160 

Gly Gly Lys Glu 
175 

Gly Gly Arg Pro 
190 

Gly He Thr Ser 
205 

Thr. Leu Val Val 

Leu Thr Gly He 

240 

Leu Arg Gly Phe 
255 

Ala Ser His Gly 
270 

Arg Arg Leu Pro 
2 85 

Thr Gly Lys Glu 
Gly Arg Val Pro 
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305 










310 










i iii 


/*xg 




T i^M 1 

325 


Asn 


Leu 


Glu 


uys 


Asp 


1 111 


bin 


uys 
340 


Tip 
IXC 


Leu 


Val 


Asp 


cj «=• >- 

OCX 

345 


nib 


Tip 

lie 


xyr 


7\ "1 -j 
riJ-d 


Tip 


Gly Asp 


Val 


V dX 






355 










360 




rX U 


lie 


nl d 


11c 


Met 


Ala 


Gly Arg 


JJC Li. 




370 










375 






Gl v 






TV nn 


Leu 


Met 


Asp 


Tyr 


Art") 


385 










390 








X 111. 


r 1 U 




Glu 


Tyr 
405 


Gly 


Cys 


Val 


rzl v 

vjx jr 


7\ 1 ■= 


rti. y 




Gly 
420 


Gin 


Glu 


His 


Val 


Glu 
425 


f 37 O 


JjcU 


Pi ., 
blu 

435 


Phe 


Thr 


Val 


Ala 


Gly 
440 




Lys 


Met 


Val 


Cys 


Leu Arg 


Glu 


Pro 


Pro 




450 










455 






Phe 


Leu 


Gly 


Pro 


Asn 


Ala 


Gly Glu 


Val 


465 










470 








lie 


Lya 


Cys 


Gly Ala 


Ser 


Tyr 


Ala 


Gin 










485 










His 


Pro 


Thr 


Cys 
500 


Ser 


Glu 


Glu 


Val 


Val 
505 


Ser 


Gly 


Leu 
515 


Asp 


Pro 


Thr 


Val 


Thr 
520 


Gly 



<210> 296 
<211> 577 
<212> PRT 
<213> Homo sapien 

<220> 

<221> VARIANT 
<222> 576 

<223> Xaa Any Amino Acid 



<400> 296 



Arg 
1 


Val 


Val 


He 


Phe 
5 


Ser 


Lys 


Ser 


Tyr 


Lys 


Glu 


Leu 


Phe 


^Ser 


Ser 


Leu Gly 


Val 






20 










25 


Asp 


Gin 


Val 


Asp 


Asp 


Gly 


Ala 


Arg 


Val 






35 










40 




Thr 


Asn 


Gin 


Lys 


Thr 


Val 


Pro 


Asn 


He 




50 










55 






Gly 


Gly 


Cys 


Asp 


Gin 


Thr 


Phe 


Gin 


Ala 


65 










70 








Lys 


Leu 


Leu 


Gin 


Glu 
85 


Asp 


Leu 


Ala 


Tyr 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Ser 


Cys 








100 










105 


Gly 


Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 






115 










120 




Thr 


Ser 


Trp 


Gly 


Leu 


Gly 


Gly 


Thr 


Cys 




130 










135 






Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


145 










150 








Ser 


Arg 


Lys 


Phe 


Gly 


Trp 


Glu 


Tyr 


Asn 






165 










Glu 


Thr 


Met 


Thr 


Lys 


Ala 


lie 


Gin 


Asn 








180 










185 


Gly 


Tyr 


Arg 


Leu 


Ser 


Leu 


Arg 


Glu 


Lys 






195 










200 




Tyr 


Gly 


Glu 


Phe 


Val 


Glu 


His 


His 


Lys 
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315 










320 


rtJ. CL 


Gly Val 




Thr 


Set* 




330 










335 




«iy 


Glu 


Ala 


1111 


O C JL. 


val 


Pro 










350 






V3X 11 


Gly Arg 


XZ A— \~J 


Glu 


T if 3 ! 1 
<UC LA 


Thr 








~j \j —> 








u t. u. 


Val 




£i i . y 


-LJ >-* 




Glv 






3 80 










7\ Tl 
ribil 


Val 


Pro 


Thr 


Thr- 


Val 

V CX J» 


Phe 




395 










400 


T iPi i 

J— Jv^ LA 


Ser 


Glu 


Glu 


Glu 


Ala 


Val 


410 










415 




Val 


Tyr 


His 


Ala 


His 


Tvr 

JL 


Lvs 










430 






Asp 


Ala 


Ser 


Gin 


Cys 


Tyr 


Val 








445 








Gin 


Leu 


Val 


Leu 


Gly 


Leu 


His 






460 










Thr 


Gin 


Gly 


Phe 


Ala 


Leu 


Gly 




475 










480 


Val 


Met 


Arg 


Thr 


Val 


Gly 


He 


490 










495 




Lys 


Leu 


Arg 


He 


Ser 


Lys 


Arg 










510 






Cys 


Xaa 


Gly 











Cys 


Pro 


His 


Ser 


Thr 


Arg 


Val 


10 










15 




Glu 


Cys 


Asn 


Val 


Leu 


Glu 


Leu 








30 






Gin 


Glu 


Val 


Leu 


Ser 


Glu 


He 








45 








Phe 


Val 


Asn 


Lys 


Val 


His 


Val 






60 










Tyr 


Gin 


Ser 


Gly 


Leu 


Leu 


Gin 


75 










80 


Asp 


Tyr 


Asp 


Leu 


He 


He 


lie 


90 










95 




Ala 


Lys 


Glu 


Ala 


Ala 


He 


Leu 








110 






Val 


Val 


Pro 


Ser 


Pro 


Gin 


Gly 








125 








Val 


Asn 


Val 


Gly 


Cys 


He 


Pro 






140 










Leu 


Gly 


Gin 


Ala 


Leu 


Cys 


Asp 




155 










160 


Gin 


Gin 


Val 


Arg 


His 


Asn 


Trp 


170 










175 




His 


He 


Ser 


Ser 


Leu 


Asn 


Trp 










190 






Ala 


Val 


Ala 


Tyr 


Val 


Asn 


Ser 








205 








He 


Lys 


Ala 


Thr 


Asn 


Lys 


Lys 
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210 










215 




Gly 


Gin 


Glu 

w J— u 


Thr 


Tvr 

xy x. 


TV 27 

X jr X. 


Thr 


Ala 


225 










230 






Glu 


At* ri 


Pro 

X. -1— V— ' 


JT1_L. V-j 


Tvr 

X- y x 


Leu 

J — 1 v — ■ *»X 


Glv 


lie 










245 








Th.ir 

X ilJ. 








Leu 


X 4*V 


Ser 


Leu 

XJ Vw- 








260 










Val 

V CX i 


Val 


Glv 


Ala 


Ser 

w JW 


Tvr 


Val 


Ala 






275 

X»J / ' 










280 


o -i- y 




Gl v 






Val 

V d JL 


Thr 

JL J> ii- 


Val 

V CX ml* 




290 










295 




ox y 


XT -IXC 


B cn 


GT n 


Glu 


1 1C L. 


Ala 


Glu 


3 05 

^ V/ 










310 






His 


Glv 

u x. y 


Val 


44Y 0 


Phe 


ux 


Ara 


Lvs 

xjy o 










325 








4—1 LX 


Glu 

nJ -X. l_X 


TjVS 


Glv 

v_j x y 


Ser 


Pro 


Glv 


Lvs 








340 










Glu 

VJJ. VX 


Glv 
"j- y 


Thr 


Glu 

%_f iJL *JL 


Thr 

x ix 


lie 

.X. .X. 


Glu 

V_J «JL» ^X 


Glv 






3 55 
*j ~j -*j 










360 

— > v_> \/ 


Tie 


Gl v 


jmx. y 




S^r 


v- y S3 


X XXX. 


Ara 
"j. y 




370 










375 




Val 




lie 




Glu 

VJ JL tX 


Xjy D 


Cpr 


Glv 












3 90 






Gl n 


Thr 

X i A JL 


Ben 


Val 


JT JL w 


x y jl 


Val 

V cx «x> 


Tvr 
xy x. 










405 








xjy o 


Prn 

IT X. W 


Glu 


T t 

J-J" ix 


Thr 

X. X IX 


Pro 
tr jl v/ 


Val 

v X, 


Ala 






420 










Gin 

V_7 -J- -I -I 


**xy 


Leu 


Phe 


Glv 


Ala 

xt ■ J ■ 


Ser 


XJ ^— wX 






435 










440 




Thr 

X, XX X. 


Thr 


Val 


Phe 


X- *1X 


Pro 


Leu 




450 










455 




Glu 


Glu 


Uys 


Ala 


lie 


Glu 


Val 


Tvr 


465 










470 






Hie 


Thr 


Leu 


Phe 


Trn 

X X* kJ 


Pro 


Leu 


Glu 

\_J -X u 










485 








Asn 


Thr 


Cys 


Tyr 


Ala 


Lys 


lie 


He 








500 










Val 


lie 


Gly Phe 


His 


lie 


Leu 


Gly 






515 










j 4& \j 


Gly 


Phe 


Ala 


Ala 


nla 


Met 


Lys 






530 










535 




Asp 


Thr 


lie 


Gly 


lie 


His 


Pro 


Thr 


545 










550 






Glu 


lie 


Thr 


Lys 


Ser 


Ser Gly 












565 








Gly 
















<210> 297 












<211> 494 












<212> PRT 












<213> Homo j 


sapien 








<400> 297 












Met 


Glu 


Asp 


Gin 


Ala 


Gly Gin Arg 


1 








5 








Gly Gly 


Ser 


Gly 


Gly 


Leu 


Ala 


Cys 








20 










Arg 


Lys 


Val 


Ala 


Val 


Val 


Asp 


Tyr 






35 










40 


Arg 


Trp 


Gly 


Leu 


Gly 


Gly Thr Cys 




50 










55 




Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


65 










70 






Pro 


Asn 


Tyr 


Gly 


Trp 


Glu 


Val 


Ala 










85 









220 



Ala 


Gin 


Phe 


Val 


He 


Ala 


Thr 


Gly 






235 










240 


Gin Gly 


Asp 


Lys 

ml 


Glu 


Tyr 


Cys 


He 




250 










255 




Pro 


Tvr 

jl jr jw 


Cys 


Pro 


Gly 

mA 


Lys 


Pro 


Leu 


265 










270 






Leu 


Glu 


Cys 


Ala 


Gly 


Phe 


Leu 


Ala 








285 








Met 


Val 


Arg 


Ser 


He 


Leu 


Leu 


Arg 








300 










Lvs 


Val 


Gly 


Ser 


Tyr 


Met 


Glu 


Gin 






315 










320 


Phe 


He 


Pro 


Val 


Met 


Val 


Gin 


Gin 




330 










335 




Leu 


Lvs 


Val 


Leu 


Ala 


Lys 


Ser 


Thr 


345 








350 






Val 

V U X 


Tvr 

xy x 


Asn 


Thr 


Val 


Leu 


Leu 


Ala 








365 








Lvs 

xj jr vj 


He 

X X ^» 


Gly 


Leu 


Glu 


Lys 

mi 


He 


Gly 








380 










Lvs 

XJ F I— ' 


He 


Pro 


Val 


Asn 


Asp 


Val 


Glu 






395 










400 


Ala 


Val 


Gly 


Asp 

X7 


He 


Leu 


Glu 


Asp 




410 










415 




He 


Gin 


Ser 


Gly 


Lys 


Leu 


Leu 


Ala 


425 










430 






Glu 

VJX lx 


Lvs 


Cys 


Asp 

XT 


Tyr 


He 


Asn 


Val 










445 








Glu 


xy x 


Gly 

JT 


Cys 


Cys 


Gly 


Leu 


Ser 








460 










Lys 


Lys 


Glu 


Asn 


Leu 


Glu 


He 


Tyr 






475 










480 


Trp 


Thr 


Val 


Ala 


Gly Arg 


Glu 


Asn 




490 










495 




Cys 


Asn 


Lys 


Phe 


Asp 


His 


Asp 


Arg 


505 










510 






Pro 


Asn 


Ala 


Gly 


Glu 


Val 


Thr 


Gin 










525 








Gly 


Leu 


Thr 


Lys 


Gin 


Leu 


Leu 


Asp 








540 










Cys 


Gly 


Glu 


Val 


Phe 


Thr 


Thr 


Leu 


555 










560 


Asp 


He 


Thr 


Gin 


Lys 


Gly 


Cys 


Xaa 




570 










575 





Asp 


Tyr 


Asp 


Leu 


Leu 


Val 


Val 


Gly 




10 










15 




Ala 


Lys 


Glu 


Ala 


Ala 


Gin 


Leu 


Gly 


25 










30 






Val 


Glu 


Pro 


Ser 


Pro 


Gin 


Gly 


Thr 










45 








Val 


Asn 


Val 


Gly 


Cys 


He 


Pro 


Lys 








60 










Leu 


Gly 


Gly 


Leu 


He 


Gin 


Asp 


Ala 






75 










80 


Gin 


Pro 


Val 


Pro 


His 


Asp 


Trp 


Arg 




90 










95 
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Mpf 


Al a 


\J3 JL U 


Ala 

jl cx 


Val 


Gin 


Asn 








100 










Hi «? 




V GLJL 


Gin 


Leu 


Gin Asp 


Arg 




115 










120 


Ala 


Qoy- 

tJ ^ JL 


Phe 


Val 

V Cx JL. 


Arc 


Glu 


His 


Thr 




130 










135 




TiVa 

J— IV O 


Glu 


lie 

-L- J. <w 


j-jvi 


Leu 


Ser 


Ala Asp 


145 










150 








Drn 

IT -£- \J 


A*r*a 
y 


J-Jfx 


PjTO 

JL il. iw/ 


Thr 


His 


lie 










165 








Thr 

X 1 1JL 


lJ c J. 


A RTi 




lie 

JL JL w 


Phe 


Trp 


Leu 








180 










Vs» 1 

v (XX 


Va 1 

V «X 


«iy 


Al a 


uCl 


Tyr 


Val 


Ala 






195 










200 




Tl f» 
lie 


vj J. _y 


Lie UL 




Thr 


Thr 


lie 




^ JL w 










215 




vJX_y 


JrJlCS 


7\ on 


ulU 


r;~| n 


Met 


Ser 


Ser 


225 

^L* O -J 










230 






XIX to 


f31 vr 
oiy 


1 II J. 




x I1C 


Leu Arg Gly 










245 








■Lie: u 


Pvo 


A em 


Glv 


Gin 

VJ J- XX 


Leu 


Gin 


Val 








260 










T.vc" 

xiy& 


Glu 




Thir 


Glv 


Thr 


Phe 


Asp 






275 










280 


V CLJL, 


JT JL w 


A Fin 


JL llli 


**J- y 


Ser 


Leu 


Asn 




290 








295 




Gov 
O t: 1_ 


Prn 

i 1U 


A an 


i in. 


Gin 

VJI.il 


Lys 


lie 


Leu 


305 










310 






vax 


rio 


XI X s 


Tip 

lie 


lyx 


Ala 


lie 


Gly 










325 








JjSU 


T 1 Vi ■*» 


tr I U 


T 1 Vi 
X J1X 


Al a 


lie 


Met 


Ala 








J T VJ 










Jriltr 


Ljxy 


nl v 

vji _y 


CJ a -K" 


O 

o e J- 


Asp 


Leu 


Met 






355 










360 


v ex. _i_ 


nic 


Thr 

JL 1 IX 


py-n 

IT J— L-/ 


Ti<=»n 

-LJ L. 1-1 


Glu 


Tyr 


Gly 




370 










375 




Al a 


V d JL 


Al a 


jt-v-l, y 


His 

X X «JW 0 


Gly Gin 


Glu 


385 










390 






Tyr 


Lys 


Pro 


Leu 


Glu 


Phe 


Thr 


Val 






405 








Tyr 


Val 


Lys 


Met 


Val 


Cys 


Leu 


Arg 








420 










Leu 


His 


Phe 


Leu 


Gly 


Pro 


Asn 


Ala 






435 










440 


Leu 


Gly 


lie 


Lys 


Cys 


Gly Ala 


Ser 




450 










455 




Gly 


lie 


His 


Pro 


Thr 


Cys 


Ser 


Glu 


465 










470 






Lys 


Arg 


Ser 


Gly 


Leu 


Asp 


Pro 


Thr 



485 



His 

4-1 ml* W 


Val 

V CX -1- 


J 


Ser 


Leu 


Asn 


Trp 

■j»- 


Gly 


105 










110 






Lvs 


Val 


Lvs 


Tyr 


Phe 


Asn 


He 


Lvs 










125 








Val 


Cvs 


Glv 


Val 


Ala 


Lys 


Glv 


Gly 

J; 








140 










His 


lie 


lie 


lie 


Ala 


Thr 


Gly 

JT 


Gly 






155 










160 


Glu 

u 


Glv 

J; 


Ala 


Leu 


Glu 


Tyr 


Gly 


He 




170 










175 




Lys 


Glu 


Ser 


Pro 


Gly 


Lys 


Thr 


Leu 


185 










190 






JJC LI 


Glu 




Ala Gly 


Phe 


Leu 


Thr 










205 








Met 


Met 


AjTQ 

J*i JL. 


Ser. 


lie 


Pro 


Leu 


Ara 








220 










Met* 


Val 


lie 

JU -JU v*- 


Glu 


His 


Met 


Ala 


Ser 






235 










240 


Pvs 


Ala 


Pro 

Jk- 


Ser 


Arg 


Val 


Arg 


Arg 




250 










255 




Thr 


Trp 


Glu 


Asp 


Ser 


Thr 


Thr 


Gly 

JT 


265 










270 






Thr 


Val 


Leu 


Trp 


Ala 


He 


Gly 


Arg 










285 








Leu 


Glu 


Lvs 


Ala 


Gly 


Val 


Asp 


Thr 








300 










Val 


Asp 


Ser 

L^ «lv 


Arg 


Glu 


Ala 


Thr 


Ser 






315 










320 


Asp 


Val 


Val 

V Ol JL 


Glu Gly Arg 


Pro 

X_ J— 


Glu 


330 










335 




Gly Arg 


Tien 


Leu 


Val 


Gin 


Ara 

**X» 


Leu 


345 










350 






Asp 


Tyr 


A qr-i 


Asn 


Val 


Pro 


Thr 


Thr 








365 








Cys 


Val 


Gly 


Leu 


Ser 


Glu 


Glu 


Glu 






380 










His 


Val 


Glu 


Val 


Tyr 


His 


Ala 


His 






395 










400 


Ala 


Gly 


Arg 


Asp 


Ala 


Ser 


Gin 


Cys 




410 










415 




Glu 


Pro 


Pro 


Gin 


Leu 


Val 


Leu 


Gly 


425 










430 






Gly Glu 


Val 


Thr Gin Gly 


Phe 


Ala 










445 








Tyr 


Ala 


Gin 


Val 


Met 


Arg 


Thr 


Val 






460 










Glu 


Val 


Val 


Lys 


Leu 


Arg 


He 


Ser 






475 








480 


Val 


Thr 


Gly 


Cys 


Cys 


Gly 







490 



<210> 298 
<211> 521 
<212> PRT 
<213> Homo sapien 



<400> 298 



Met 


Ala 


Ala 


Met 


Ala 


Val 


Ala 


Leu 


Arg 


Gly 


Leu 


Gly Gly Arg 


Phe 


Arg 


1 








5 










10 






15 




Trp 


Arg 


Thr 


Gin 


Ala 


Val Ala Gly Gly Val 


Arg 


Gly Ala Ala 


Arg 


Gly 




20 










25 






30 






Ala 


Ala 


Gly 


Gin Arg 


Asp 


Tyr 


Asp 


Leu 


Leu 


Val 


Val Gly Gly Gly 


Ser 






35 










40 








45 






Gly 


Gly 


Leu 


Ala 


Cys 


Ala 


Lys 


Glu 


Ala 


Ala 


Gin 


Leu Gly Arg 


Lys 


Val 


50 








55 










60 






Ser 


Val 


Val 


Asp 


Tyr 


Val 


Glu 


Pro 


Ser 


Pro 


Gin 


Gly Thr Arg 


Trp 


Gly 
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65 

Leu Gly Gly Thr 

His Gin Ala Ala 

100 

Gly Trp Glu Val 
115 

Glu Ala Val Gin 
13 0 

Gin Leu Gin Asp 
145 

Val Asp Glu His 

Leu Leu Ser Ala 

180 

Tyr Pro Thr His 
195 

Asp lie Phe Trp 
210 

Ala Ser Tyr Val 
225 

Leu Asp Thr Thr 

Gin Gin Met Ser 

260 

Arg Phe Leu Arg 
275 

Gly Gin Leu Gin 
290 

Thr Gly Thr Phe 
305 

Thr Arg Ser Leu 

Thr Gin Lys lie 

340 

lie Tyr Ala He 
355 

Thr Ala He Met 
370 

Ser Ser Asp Leu 
385 

Pro Leu Glu Tyr 

Arg His Gly Gin 

42 0 

Leu Glu Phe Thr 
435 

Met Val Cys Leu 
450 

Leu Gly Pro Asn 
465 

Lys Cys Gly Ala 

Pro Thr Cys Ser 

500 

Gly Leu Asp Pro 
515 



70 

Cys Val Abu Val 
85 

Leu Leu Gly Gly 

Ala Gin Pro Val 

120 

Asn His Val Lys 
135 

Arg Lys Val Lys 
150 

Thr Val Cys Gly 
165 

Asp His He He 

He Glu Gly Ala 

200 

Leu Lys Glu Ser 
215 

Ala Leu Glu Cys 
230 

He Met Met Arg 
245 

Ser Met Val He 

Gly Cys Ala Pro 

280 

Val Thr Trp Glu 
295 

Asp Thr Val Leu 
310 

Asn Leu Glu Lys 
325 

Leu Val Asp Ser 

Gly Asp Val Val 

360 

Ala Gly Arg Leu 
375 

Met Asp Tyr Asp 
390 

Gly Cys Val Gly 
405 

Glu His Val Glu 

Val Ala Gly Arg 

440 

Arg Glu Pro Pro 
455 

Ala Gly Glu Val 
470 

Ser Tyr Ala Gin 
485 

Glu Glu Val Val 

Thr Val Thr Gly 

520 



75 

Gly Cys He Pro 
9.0 

Leu He Gin Asp 
105 

Pro His Asp Trp 

Ser Leu Asn Trp 

140 

Tyr Phe Asn He 
155 

Val Ala Lys Gly 
170 

He Ala Thr Gly 
185 

Leu Glu Tyr Gly 

Pro Gly Lys Thr 

220 

Ala Gly Phe Leu 
235 

Ser He Pro Leu 
250 

Glu His Met Ala 
265 

Ser Arg Val Lys 

Asp Ser Thr Thr 

300 

Trp Ala He Gly 
315 

Ala Gly Val Asp 
330 

Arg Glu Ala Thr 
345 

Glu Gly Arg Pro 

Leu Val Gin Arg 

380 

Asn Val Pro Thr 
395 

Leu Ser Glu Glu 
410 

Val Tyr His Ala 
425 

Asp Ala Ser Gin 

Gin Leu Val Leu 

460 

Thr Gin Gly Phe 
475 

Val Met Arg Thr 
490 

Lys Leu Arg He 

505 

Cys 



80 

LyB Lys Leu Met 
95 

Ala Pro Asn Tyr 
110 

Arg Lys Met Ala 
12 5 

Gly His Arg Val 

Lys Ala Ser Phe 

160 

Gly Lys Glu He 
175 

Gly Arg Pro Arg 
190 

He Thr Ser Asp 
205 

Leu Val Val Gly 

Thr Gly He Gly 

240 

Arg Gly Phe Asp 
255 

Ser His Gly Thr 
270 

Arg Leu Pro Asp 
285 

Gly Lys Glu Asp 

Arg Val Pro Asp 

320 

Thr Ser Pro Asp 
335 

Ser Val Pro His 
350 

Glu Leu Thr Pro 
365 

Leu Phe Gly Gly 

Thr Val Phe Thr 

400 

Glu Ala Val Ala 
415 

His Tyr Lys Pro 
430 

Cys Tyr Val Lys 
445 

Gly Leu His Phe 

Ala Leu Gly He 

4B0 

Val Gly He His 
495 

Ser Lys Arg Ser 
510 



<210> 299 
<211> 549 
<212> PRT 
<213> Homo sapien 

<400> 299 

Met Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu 
1 5 10 15 
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Ala Ala Glu Thr 

20 ' 

Gly Gly Hie Gly 
35 

Lys Leu Leu Lys 
50 

Tyr Asp Leu lie 
65 

Lys Glu Ala Ala 

Thr Pro Thr Pro 

100 

Asn Val Gly Cys 
115 

Gly Gin Ala Leu 
130 

Thr Val Lys His 
145 

He Gly Ser Leu 

Val Val Tyr Glu 

180 

Lys Ala Thr Asn 
195 

Phe Leu lie Ala 
210 

Asp Lys Glu Tyr 
225 

Cys Pro Gly Lys 

Cys Ala Gly Phe 

260 

Arg Ser He Leu 
275 

Gly Glu His Met 
230 

Pro He Lys Val 
305 

Val Val Ala Gin 

Asn Thr Val Met 

340 

Gly Leu Glu Thr 
355 

Pro Val Thr Asp 
370 

Gly Asp He Leu 
385 

Ala Gly Arg Leu 

Cys Asp Tyr Glu 

42 0 

Gly Ala Cys Gly 
435 

Glu Asn He Glu 
450 

He Pro Ser Arg 
465 

Thr Lys Asp Asn 

Ala Gly Glu Val 

500 

Thr Lys Lys Gin 
515 

Glu Val Phe Thr 
530 

Leu Gin Ala Gly 



Asp Leu Pro Val 

Pro Thr Leu Lys 

40 

Met Asn Gly Pro 
55 

He He Gly Gly 
70 

Gin Tyr Gly Lys 
85 

Leu Gly Thr Arg 

He Pro Lys Lys 

12 0 

Gin Asp Ser Arg 
135 

Asp Trp Asp Arg 
150 

Asn Trp Gly Tyr 
165 

Asn Ala Tyr Gly 

Asn Lys Gly Lys 

200 

Thr Gly Glu Arg 
215 

Cys He Ser Ser 
230 

Thr Leu Val Val 
245 

Leu Ala Gly He 

Leu Arg Gly Phe 

280 

Glu Glu His Gly 
295 

Glu Gin He Glu 
310 

Ser Thr Asn Ser 
325 

Leu Ala He Gly 

Val Gly Val Lys 

360 

Glu Glu Gin Thr 
375 

Glu Asp Lys Val 
390 

Leu Ala Gin Arg 
405 

Asn Val Pro Thr 

Leu Ser Glu Glu 

440 

Val Tyr His Ser 
455 

Asp Asn Asn Lys 
470 

Glu Arg Val Val 
485 

Thr Gin Gly Phe 

Leu Asp Ser Thr 

520 

Thr Leu Ser Val 
535 

Cys 



Val Phe Val Lys 
25 

Ala Tyr Gin Glu 

Glu Asp Leu Pro 

60 

Gly Ser Gly Gly 
75 

Lys Val Met Val 
90 

Trp Gly Leu Gly 
105 

Leu Met His Gin 

Asn Tyr Gly Trp 

140 

Met He Glu Ala 
155 

Arg Val Ala Leu 
170 

Gin Phe He Gly 
185 

Glu Lys He Tyr 

Pro Arg Tyr Leu 

220 

Asp Asp Leu Phe 
235 

Gly Ala Ser Tyr 
250 

Gly Leu Asp Val 
265 

Asp Gin Asp Met 

He Lys Phe He 

300 

Ala Gly Thr Pro 
315 

Glu Glu He He 
330 

Arg Asp Ala Cys 
345 

He Asn Glu Lys 

Asn Val Pro Tyr 

380 

Glu Leu Thr Pro 
395 

Leu Tyr Ala Gly 
410 

Thr Val Phe Thr 
425 

Lys Ala Val Glu 

Tyr Phe Trp Pro 

460 

Cys Tyr Ala Lys 
475 

Gly Phe His Val 
490 

Ala Ala Ala Leu 
505 

He Gly He His 

Thr Lys Arg Ser 

540 



Gin 


Arg 


Lys 


He 




30 






Gly Arg 


Leu 


Gin 


45 








Lys 


Ser 


Tyr 


Asp 


Leu 


Ala 


Ala 


Ala 








80 


Leu 


Asp 


Phe 


Val 






95 




Gly 


Thr 


Cys 


Val 




110 






Ala 


Ala 


Leu 


Leu 


125 








Lys 


Val 


Glu 


Glu 


Val 


Gin 


Asn 


His 








160 


Arg 


Glu 


Lys 


Lys 






175 




Pro 


His 


Arg 


He 




190 






Ser 


Ala 


Glu 


Arg 


205 








Gly 


He 


Pro 


Gly 

•mi 


Ser 


Leu 


Pro 


Tyr 








240 


Val 


Ala 


Leu 


Glu 






255 




Thr 


Val 


Met 


Val 




270 






Ala 


Asn 


Lys 


He 


285 








Arg 


Gin 


Phe 


Val 


Gly Arg 


Leu 


Arg 








320 


Glu Gly 


Glu 


Tyr 






335 




Thr Arg 


Lys 


He 




350 






Thr 


Gly 


Lys 


He 


365 








He 


Tyr 


Ala 


He 


Val 


Ala 


He 


Gin 








400 


Ser 


Thr 


Val 


Lys 






415 




Pro 


Leu 


Glu 


Tyr 




430 






Lys 


Phe 


Gly 


Glu 


445 








Leu 


Glu 


Trp 


mm m 1 

Thr 


He 


He 


Cys 


Asn 






480 


Leu 


Gly 


Pro 


Asn 






495 




Lys 


Cys 


Gly 


Leu 




510 






Pro 


Val 


Cys 


Ala 


525 








Gly Ala 


Ser 


He 
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<210> 300 
<211> 613 
<212> PRT 

<213> Mus musculus 
<220> 

<221> VARIANT 
<222> 612 

<223> Xaa = Any Amino Acid 



<400> 300 



Met 
1 


Pro 


Val 


Asp 


Asp 
5 


Cys 


Trp 


Leu 


Thr 


Pne 


Val 


Gin 


Thr 


Val 


Trp 


Val 








20 








Trp 


Pne 


Pro 


Gly Phe 


Leu 


Pro 


Pro 






35 










40 


Arg 


val 


Leu 


Leu 


Arg 


Gly 


Pro Arg 




50 










55 




Pro 


Ser 


Lys 


Thr 


Leu 


Pro 


Ser 


Ser 


65 










70 






Pro 


Cys 


lie 


Cys 


Pro 


Pro 


Pro 


Ser 








B5 








Asn 


Thr 


Gin 


Ser 


Glu 


Leu 


Pro 


Asn 








100 










Pro 


Thr 


Met 


Asn 


Gly 


Ser 


Lys 


Asp 






115 










120 


Leu 


lie 


lie 


lie 


Gly 


Gly 


Gly 


Ser 




130 










135 




Ala 


Ala 


Lys 


Phe 


Asp 


Lys 


Lys 


Val 


145 










1-50 






Thr 


Pro 


Leu 


Gly 


Thr 


Arg 


Trp 


Gly 










165 








Gly 


Cys 


lie 


Pro 


Lys 


Lys 


Leu 


Met 








180 










Ala 


Leu 


Lys 


Asp 


Ser 


Arg 


Asn 


Tyr 






195 










200 


Lys 


•His 


Asp 


Trp 


Glu 


Lys 


Met 


Thr 




210 










215 




Ser 


Leu 


Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


225 










230 






Tyr 


Glu 


Asn 


Ala 


Tyr 


Gly 


Arg 


Phe 










245 








Thr 


Asn 


Asn 


Lys 


Gly 


Lys 


Glu 


Lys 








260 










lie 


Ala 


Thr 


Gly Glu 


Arg 


Pro 


Arg 






275 










280 


Glu 


Tyr 


Cys 


He 


Ser 


Ser 


Asp 


Asp 




290 










295 




Gly 


Lys 


Thr 


Leu 


Val 


Val 


Gly Ala 


305 










310 






Gly 


Phe 


Leu 


Ala 


Gly 


He 


Gly 


Leu 










325 








lie 


Leu 


Leu 


Arg 


Gly 


Phe 


Asp 


Gin 








340 










His 


Met 


Glu 


Glu 


His 


Gly 


He 


Lys 






355 










360 


Lys 


lie 


Glu 


Gin 


He 


Glu 


Ala 


Gly 




370 










375 




Ala 


Gin 


Ser 


Thr 


Asn 


Ser 


Glu 


Glu 


385 










390 






Val 


Leu 


Leu 


Ala 


Val 


Gly 


Arg 


Asp 










405 






Glu 


Thr 


Val 


Gly 


Val 


Lys 


He 


Asn 



Tyr 


Phe 


Pro 


Ala 


Ser 


Arg 


Gly Arg 




10 










15 




Ala 


Pro 


Thr 


Cys 


Pro 


Asn 


Cys 


Cys 


25 










30 






Val 


Pro 


Arg 


Pro 


Pro 


Has 


Val 


Pro 










45 








Gly Ala 


Val 


Leu 


Pro 


Ala 


Ser 


Arg 








60 








Ser 


Gin 


Thr 


Pro 


Cys 


Pro 


Thr 


Asp 






75 










80 


Thr 


Pro 


Asp 


Ser 


Arg 


Gin 


Glu 


Lys 




90 










95 




Lys 


Lys 


Gly Gin Leu 


Gin 


Lys 


Leu 


105 










110 






Pro 


Pro 


Gly 


Ser 


Tyr 


Asp 


Phe 


Asp 










125 








Gly Gly 


Leu 


Ala 


Ala 


Ala 


Lys 


Glu 








140 










Leu 


Val 


Leu 


Asp 


Phe 


Val 


Thr 


Pro 






155 










160 


Leu 


Gly 


Gly 


Thr 


Cys 


Val 


Asn 


Val 




170 










175 




His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly 


Gin 


185 










190 




Gly Trp 


Lys 


Val 


Glu 


Asp 


Thr 


Val 










205 








Glu 


Ser 


Val 


Gin 


Ser 


His 


He 


Gly 








220 








Ala 


Leu 


Arg 


Glu 


Lys 


Lys 


Val 


Val 






235 










240 


He 


Gly 


Pro 


His 


Arg 


He 


Val 


Ala 




250 










255 




He 


Tyr 


Ser 


Ala 


Glu 


Arg 


Phe 


Leu 


265 










270 






Tyr 


Leu 


Gly 


He 


Pro 


Gly Asp 


Lys 










285 








Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


Cys 


Pro 








300 










Ser 


Tyr 


Val 


Ala 


Leu 


Glu 


Cys 


Ala 






315 










320 


Asp 


Val 


Thr 


Val 


Met 


Val 


Arg 


Ser 




330 










335 




Asp 


Met 


Ala 


Asn 


Lys 


He 


Gly 


Glu 


345 










350 






Phe 


He 


Arg 


Gin 


Phe 


Val 


Pro 


Thr 










365 








Thr 


Pro 


Gly Arg 


Leu 


Arg 


Val 


Thr 








380 










Thr 


He 


Glu Gly Glu 


Phe 


Asn 


Thr 






395 










400 


Ser 


Cys 


Thr 


Arg 


Thr 


He 


Gly 


Leu 




410 










415 




Glu 


Lys 


Thr 


Gly Lys 


He 


Pro 


Val 
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420 

Thr Asp Glu Glu 
435 

lie Leu Glu Gly 
45 0 

Arg Leu Leu Ala 
465 

Tyr Asp Asn Val 

Cys Gly Leu Sear 

500 

lie Glu Val Tyr 
515 

Ser Arg Asp Asn 
530 

Asp Asp Glu Arg 
545 

Glu Val Thr Gin 

Gin Gin Leu Asp 

580 

Phe Thr Thr Leu 
595 

Ser Gly Cys Xaa 
610 



Gin Thr Asn Val 

440 

Lys Leu Glu Leu 
455 

Gin Arg Leu Tyr 
470 

Pro Thr Thr Val 
485 

Glu Glu Lys Ala 

His Ser Phe Phe 

520 

Asn Lys Cys Tyr 
535 

Val Val Gly Phe 
550 

Gly Phe Ala Ala 
565 

Ser Thr lie Gly 

Ser Val Thr Lys 

600 

Gly 



425 

Pro Tyr lie Tyr 

Thr Pro Val Ala 

460 

Gly Gly Ser Asn 
475 

Phe Thr Pro Leu 
490 

Val Glu Lys Phe 
505 

Trp Pro Leu Glu 

Ala Lys lie lie 

540 

His Val Leu Gly 
555 

Ala Leu Lys Cys 
570 

lie His Pro Val 
585 

Arg Ser Gly Gly 



430 

Ala lie Gly Asp 
445 

He Gin Ala Gly 

Val Lys Cys Asp 

480 

Glu Tyr Gly Cys 
495 

Gly Glu Glu Asn 
510 

Trp Thr Val Pro 
525 

Cys Asn Leu Lys 

Pro Asn Ala Gly 

560 

Gly Leu Thr Lys 
575 

Cys Ala Glu He 
590 

Asp He Leu Gin 
605 



<210> 301 
<211> 310 
<212> PRT 

<213> Mus musculus 
<400> 301 



Met 


Asn 


Gly 


Ser 


Lys 


Asp 


Pro 


Pro 


Gly Ser 


Tyr Asp 


Phe 


Asp 


Leu 


He 


1 








5 










10 










15 




He 


■He 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Ala 


Ala 


Ala 


Lys 


Glu 


Ala 


Ala 








20 










25 










30 






Lys 


Phe 


Asp 


Lys 


Lys 


Val 


Leu 


Val 


Leu 


Asp 


Phe 


Val 


Thr 


Pro 


Thr 


Pro 






35 










40 








45 








Leu 


Gly 


Thr 


Arg 


Trp 


Gly 


Leu 


Gly Gly Thr 


Cys 


Val 


Asn 


Val 


Gly 


Cys 




50 










55 










60 










He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly 


Gin 


Ala 


Leu 


65 










70 










75 










80 


Lys 


Asp 


Ser 


Arg 


Asn 


Tyr 


Gly 


Trp 


Lys 


Val 


Glu 


Asp 


Thr 


Val 


Lys 


His 










85 










90 










95 




Asp 


Trp 


Glu 


Lys 


Met 


Thr 


Glu 


Ser 


Val 


Gin 


Ser 


His 


He 


Gly Ser 


Leu 








100 










105 










110 






Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Ala 


Leu 


Arg 


Glu 


Lys 


Lys 


Val 


Val 


Tyr 


Glu 






115 










120 










125 








Asn 


Ala 


Tyr 


Gly 


Arg 


Phe 


He 


Gly 


Pro 


His 


Arg 


He 


Val 


Ala 


Thr 


Asn 




130 










135 










140 










Asn 


Lys 


Gly 


Lys 


Glu 


Lys 


He 


Tyr 


Ser 


Ala 


Glu 


Arg 


Phe 


Leu 


He 


Ala 


145 










150 










155 










160 


Thr 


Gly 


Glu 


Arg 


Pro 


Arg 


Tyr 


Leu 


Gly 


He 


Pro Gly Asp 


Lys 


Glu 


Tyr 










165 










170 










175 




Cys 


He 


Ser 


Ser 


Asp 


Asp 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


Cys 


Pro 


Gly 


Lys 








180 










185 










190 






Thr 


Leu 


Val 


Val 


Gly Ala 


Ser 


Tyr Val Ala 


Leu 


Glu 


Cys 


Ala Gly 


Phe 






195 










200 










205 








Leu 


Ala 


Gly 


He 


Gly Leu Asp Val 


Thr 


Val 


Met 


Val 


Arg 


Ser 


He 


Leu 




210 










215 










220 










Leu 


Arg 


Gly 


Phe 


Asp 


Gin 


Asp 


Met 


Ala 


Asn 


Lys 


He 


Gly 


Glu 


His 


Met 


225 










230 










235 










240 


Glu' Glu 


His 


Gly 


He 


Lys 


Phe 


He 


Arg 


Gin 


Phe 


Val 


Pro 


Thr 


Lys 


He 








245 








250 










255 




Glu 


Gin 


He 


Glu 


Ala 


Gly Thr 


Pro 


Gly Arg 


Leu 


Arg 


Val 


Thr 


Ala 


Gin 








260 










265 










270 
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Ser Thr Asn Ser Glu Glu Thr He Glu Gly Glu Phe Asn Thr Val Leu 

275 280 285 

Leu Ala Val Gly Arg Asp Ser Cys Thr Arg Thr lie Gly Leu Glu Thr 

290 " 295 300 

Val Gly Val Lys He Asn 
305 310 



<210> 302 
<211> 613 
<212> PRT 

<213> Mus musculus 



<400> 302 



Met 


Ser 


Ser 


Pro 


Pro Gly 


Arg 


Arg 


Ala 


Arg 


Leu 


Ala 


Ser 


Pro 


Gly 


Thr 


1 








5 










10 










15 




Ser 


Arg 


Pro 


Ser 


Ser 


Glu 


Ala 


Arg 


Glu 


Glu 


Leu 


Arg 


Arg 


Arg 


Leu 


Arg 






20 










25 










30 






Asp 


Leu 


He 


Glu 


Gly Asn 


Arg 


Val 


Met 


lie 


Phe 


Ser 


Lys 


Ser 


Tyr 


Cys 




35 










40 










45 








Pro 


His 


Ser 


Thr 


Arg 


Val 


Lys 


Glu 


Leu 


Phe 


Ser 


Ser 


Leu 


Gly Val 


Val 




50 








55 










60 










Tyr 


Asn 


He 


Leu 


Glu 


Leu 


Asp 


Gin 


Val 


Asp 


Asp 


Gly Ala 


Ser 


Val 


Gin 


65 










70 










75 










80 


Glu 


Val 


Leu 


Thr 


Glu 


He 


Ser 


Asn 


Gin 


Lys 


Thr 


Val 


Pro 


Asn 


lie 


Phe 










85 










90 










95 


His 


Val 


Asn 


Lys 


Val 


His 


Val 


Gly 


Gly 


Cys 


Asp 


Arg 


Thr. 


Phe 


Gin 


Ala 






100 










105 










110 






Gin 


Asn 


Gly 
115 


Leu 


Leu 


Gin 


Lys 


Leu 
120 


Leu 


Gin 


Asp 


Asp 


Ser 
125 


Ala 


His 


Asp 


Tyr 


Asp 


Leu 


He 


He 


He 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Ser 


Cys 


Ala 


130 










135 










14 0 








Val 


Lys 


Glu 


Ala 


Ala 


Asn 


Leu 


Gly 


Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 


145 










150 








155 










160 


Val 


Pro 


Ser 


Pro 


Gin Gly 


Thr 


Thr 


Trp 


Gly 


Leu 


Gly 


Gly 


Thr 


Cys 


Val 










165 










170 










175 




Aan 


Val 


Gly 


Cys 


He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 






180 










185 










190 




Gin 


Gly 


His 


Ala 


Leu 


Gin 


Asp 


Ala 


Lys 


Lys 


Tyr 


Gly 


Trp 


Glu 


Tyr 


Asn 




195 










200 










205 






His 


Gin 


Val 


Lys 


His 


Asn Trp 


Glu 


Ala 


Met 


Thr 


Glu 


Ala 


He 


Gin 


Ser 




210 








215 










220 










_. -a 

He 


Gly 


Ser 


Leu 


Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Thr 


Leu Arg 


Glu 


Lys 




225 








230 










235 










240 


Val 


Thr 


Tyr 


Val 


Asn 


Ser 


Phe 


Gly 


Glu 


Phe 


Val 


Asp 


Leu 


His 


Lys 


He 








245 










250 










255 




Lys 


Ala 


Thr 


Asn 


Lys 


Lys 


Gly 


Gin 


GlU 


Thr 


Phe 


Tyr 


Thr 


Ala 


Ser 


Lys 






260 








265 










270 






Phe 


Val 


He 


Ala 


Thr 


Gly 


Glu 


Arg 


Pro 


Arg 


Tyr 


Leu Gly 


lie 


Gin 


Gly 






275 








280 










285 








Asp 


Lys 


Glu 


Tyr 


Cys 


He 


Thr 


Ser 


Asp 


Asp 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


290 










295 










300 










Cys 


Pro 


Gly 


Cys 


Thr 


Leu 


Val 


Val 


Gly Ala 


Ser 


Tyr Val 


Gly Leu 


Glu 


305 






310 










315 










320 


Cys 


Ala 


Gly 


Phe 


Leu 


Ala 


Gly 


Leu 


Gly 


Leu 


Asp 


Val 


Thr 


Val 


Met 


Val 






325 










330 










335 




Arg 


Ser 


Val 


Leu 


Leu 


Arg 


Gly 


Phe 


Asp 


Gin 


Glu 


Met 


Ala 


Glu 


Lys 


Val 






340 










345 










350 






Gly 


Ser 


Tyr 


Leu 


Glu 


Gin 


Gin 


Gly 


Val 


Lys 


Phe 


Gin Arg 


LyB 


Phe 


Thr 




355 










360 










365 








Pro 


He 
370 


Leu 


Val 


Gin 


Gin 


Leu 
375 


Glu 


Lys 


Gly 


Leu 


Pro 
380 


Gly 


Lys 


Leu 


Lys 


Val 


Val 


Ala 


Lys 


Ser 


Thr 


Glu 


Gly 


Pro 


Glu 


Thr 


Val 


Glu 


Gly 


He 


Tyr 


385 








390 








395 










400 


Asn 


Thr 


Val 


Leu 


Leu 
405 


Ala 


He 


Gly 


Arg 


Asp 
410 


Ser 


Cys 


Thr 


Arg 


Lys 
415 


He 


Gly 


Leu 


Glu 


Lys 


He 


Gly 


Val 


Lys 


He 


Asn 


Glu 


Lys 


Asn 


Gly 


Lys 


He 
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420 


Pro 


Val 


Asn 


Asp 






435 




Gly Asp 


lie 


Leu 




450 






Ala 


Gly 


Lys 


Leu 


465 








Cys 


Asp 


Tyr 


—r- -m 

lie 


Gly Cys 


Cys 


Gly 








500 


Glu 


Asn 


Lieu 


Glu 






515 




Val 


Ala 


Gly 


Arcr 




530 






Lys 


Phe 


Asp 


Asn 


545 








Ala Gly 


Glu 


He 


Thr 


Lys 


Gin 


Leu 








580 


Glu 


Val 


Phe 


Thr 






595 




Thr 


Gin 


Lys 


Gly 




610 







Val Glu Gin Thr 

440 

Asp Gly Lys Pro 
455 

Leu Ala Arg Arg 
470 

Asn He Pro Thr 
485 

Leu Ser Glu Glu 

Val Tyr His Thr 

520 

Asp Asn Asn Thr 
535 

Glu Arg Val Val 
550 

Thr Gin Gly Phe 
565 

Leu Asp Asp Thr 

Thr Leu Glu He 

600 

Cys 



425 

Asn Val Pro His 

Glu Leu Thr Pro 

460 

Leu Phe Gly Val 
475 

Thr Val Phe Thr 
490 

Lys Ala He Glu 
505 

Leu Phe Trp Pro 

Cys Tyr Ala Lys 

540 

Gly Phe His Leu 
555 

Ala Ala Ala Met 
570 

He Gly He His 
585 

Thr Lys Ser Ser 



430 

Val Tyr Ala He 
445 

Val Ala He Gin 

Ser Leu Glu Lys 

480 

Pro Leu Glu Tyr 
495 

Met Tyr Lys Lys 
510 

Leu Glu Trp Thr 
525 

He He Cys Asn 

Leu Gly Pro Asn • 

560 

Lys Cys Gly Leu 
575 

Pro Thr Cys Gly 
590 

Gly Leu Asp He 
605 



<210> 303 
<211> 524 
<212> PRT 

<213> Mus mus cuius 
<220> 

<221> VARIANT 
<222> 523 

<223> Xaa = Any Amino Acid 



<400> 303 



Met 


Val 


Ala 


Ala 


Met 


Val 


Ala 


Ala 


Leu 


Arg 


Gly 


Pro 


Ser 


Arg 


Arg 


Phe 


1 








5 










10 










15 




Arg 


Pro 


Arg 


Thr 


Arg 


Ala 


Leu 


Thr 


Arg 


Gly 


Thr 


Arg Gly Ala Ala 


Ser 








20 










25 










30 






Ala 


Ala 


Gly 


Gly 


Gin 


Gin 


Ser 


Phe 


Asp 


Leu 


Leu 


Val 


He Gly Gly Gly 






35 










40 










45 








Ser 


Gly 


Gly 


Leu 


Ala 


Cys 


Ala 


Lys 


Glu 


Ala 


Ala 


Gin 


Leu 


Gly 


Lys 


Lys 




50 










55 










60 




Val 


Ala 


Val 


Ala 


Asp 


Tyr 


Val 


Glu 


Pro 


Ser 


Pro 


Arg 


Gly Thr 


Lys 


Trp 


65 










70 










75 










80 


Gly 


Leu 


Gly 


Gly 


Thr 


Cys 


Val 


Asn 


Val 


Gly 


Cys 


He 


Pro 


Lys 


Lys 


Leu 










85 










90 










95 




Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly Gly Met 


He 


Arg 


Asp 


Ala 


His 


His 








100 










105 










110 






Tyr Gly 


Trp 


Glu 


Val 


Ala 


Gin 


Pro 


Val 


Gin 


His 


Asn 


Trp 


Lys 


Thr 


Met 






115 










120 










125 






Ala 


Glu 


Ala 


Val 


Gin 


Asn 


His 


Val 


Lys 


Ser 


Leu 


Asn 


Trp 


Gly His 


Arg 




130 










135 










140 








Val 


Gin 


Leu 


Gin 


Asp 


Arg 


Lys 


Val 


Lys 


Tyr 


Phe 


Asn 


He 


Lys 


Ala 


Ser 


145 










150 










155 








160 


Phe 


Val 


Asp 


Glu 


His 


Thr 


Val 


Arg 


Gly Val 


Asp 


Lys 


Gly 


Gly 


Lys 


Ala 










165 










170 








• 


175 




Thr 


Leu. 


Leu 


Ser 


Ala 


Glu 


His 


He 


Val 


He 


Ala 


Thr 


Gly Gly Arg 


Pro 








180 










185 










190 






Arg 


Tyr 


Pro 


Thr 


Gin 


Val 


Lys 


Gly Ala Leu 


Glu 


Tyr 


Gly 


He 


Thr 


Ser 






195 










200 










205 








Asp 


Asp 


He 


Phe 


Trp 


Leu 


Lys 


Glu 


Ser 


Pro 


Gly 


Lys 


Thr 


Leu 


Val 


Val 




.210 










215 










220 










Gly Ala 


Ser 


Tyr 


Val 


Ala 


Leu 


Glu 


Cys 


Ala 


Gly 


Phe 


Leu 


Thr 


Gly 


He 
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225 










230 










235 










240 


Gly 


Leu 




Thr 


Thr 


Val 


Met 


Met 


Arg 


Ser 


He 


Pro 


Leu Arg 


Gly 


Phe 








245 










250 










255 




Asp 


Gin 


Gin 


Met 
260 


Ser 


Ser 


Leu 


Val 


Thr 
265 


Glu 


His 


Met 


Glu 


Ser 
270 


His 


Gly 


Thr 


Gin 


Phe 
275 


Leu 


Lvs 


Gly 


Cys 


Val 
280 


Pro 


Ser 


His 


He 


Lys 
285 


Lys 


Leu 


Pro 


Thr* 


Asn 
290 


Gin 


Leu 


Gin 


Val 


Thr 
295 


Trp 


Glu 


Asp 


His 


Ala 
300 


Ser 


Gly 


Lys 


Glu 


Agn 


Thr 


Glv 


Thr 


Phe 


Asp 


Thr 


Val 


Leu Trp 


Ala He Gly Arg Val 


Pro 


305 


• 






310 










315 










320 


Glu 

\J -L 1-1. 


Thr 




-L XX X. 


XJ Vw LX 


Asn 


Leu 


Glu 


Lvs 


Ala 


Gly 


He 


Ser 


Thr 


Asn 


Pro 








325 










330 










335 




Lvs 

j— i y fc* 


Asn 


Gin 


Lvs 


He 


He 


Val 


Asp 


Ala 


Gin 


Glu 


Ala 


Thr 


Ser 


Val 


Pro 






340 








345 










350 






His 


He 


Tvr 


Ala 


He 


Gly Asp Val 


Ala 


Glu 


Gly Arg 


Pro 


Glu 


Leu 


Thr 






355 










360 










365 








Pro 


Thr 


Ala 


He 


Lvs 


Ala 


Gly Lys 


Leu 


Leu 


Ala 


Gin 


Arg 


Leu 


Phe 


Gly 




370 










375 










380 










Lys 


Ser 


Ser 


Thr 


Ley 


Met 


Asp 


Tyr 


Ser 


Asn 


Val 


Pro 


Thr 


Thr 


Val 


Phe 


385 










390 










395 










400 


Thr 


Pro 


Leu 


Glu 


Tyr 


Gly Cys Val 


Gly 


Leu 


Ser 


Glu 


Glu 


Glu 


Ala 


Val 










405 










410 










415 




Ala 


Leu 


His 


Glv 
420 


Gin 


Glu 


His 


Val 


Glu 
425 


Val 


Tyr 


His 


Ala 


Tyr 
430 


Tyr 


Lys 




Leu 

J_J t_ \JL 


Glu 


Phe 


Thr 


Val 


Ala Asp 


Arg 


Asp 


Ala 


Ser 


Gin 


Cys 


Tyr 


He 






435 










440 










445 








Lys 


Met 


Val 


Cys 


Met 


Arg 


Glu 


Pro 


Pro 


Gin 


Leu 


Val 


Leu Gly 


Leu 


His 




450 










455 










460 










Phe 


Leu 


Gly 


Pro 


Asn Ala Gly Glu 


Val 


Thr 


Gin 


Gly 


Phe 


Ala 


Leu 


Gly 


465 










470 










475 










480 


lie 


Lys 


Cys 


Gly 


Ala 
485 


Ser 


Tyr 


Ala 


Gin 


Val 
490 


Met 


Gin 


Thr 


Val 


Gly 
495 


He 


His 


Pro 


Thr 


Cys 
500 


Ser 


Glu 


Glu 


Val 


Val 
505 


Lys 


Leu 


His 


He 


Ser 
510 


Lys 


Arg 


Ser 


Gly 


Leu 
515 


Glu 


Pro 


Thr 


Val 


Thr 
520 


Gly 


Cys 


Xaa 


Gly 











<210> 304 
<211> 528 
<212> PRT 
<213> Mus mus cuius 

<220> 

<221> VARIANT 
<222> 527 

<223> Xaa = Any Amino Acid 



<400> 304 



Met 


Ala 


Ala 


Met 


Val 


Ala Gly Arg 


Met 


Trp 


Ala 


Ala Leu 


Arg 


Gly 


Pro 


1 








5 






10 








15 




Ser 


Arg 


Arg 


Phe 


Arg 


Pro Arg Thr 


Arg 


Ala 


Leu 


Thr Arg 


Gly Thr Arg 








20 






25 








30 






Gly 


Ala 


Ala 


Ser 


Ala 


Ala Gly Gly 


Gin 


Gin 


Ser 


Phe Asp 


Leu 


Leu 


Val 






35 






40 








45 








He 


Gly 


Gly 


Gly Ser 


Gly Gly Leu 


Ala 


Cys 


Ala 


Lys Glu 


Ala 


Ala 


Gin 




50 








55 








60 








Leu 


Gly 


Lys 


Lys 


Val 


Ala Val Ala 


Asp 


Tyr 


Val 


Glu Pro 


Ser 


Pro 


Arg 


65 










70 






75 








80 


Gly 


Thr 


Lys 


Trp 


Gly 


Leu Gly Gly 


Thr 


Cys 


Val 


Asn Val 


Gly 


Cys 


He 










85 






90 








95 




Pro 


Lys 


Lys 


Leu 


Met 


His Gin Ala 


Ala 


Leu 


Leu Gly Gly 


Met 


He 


Arg 








100 






105 








110 






Asp 


Ala 


His 


His 


Tyr 


Gly Trp Glu 


Val 


Ala 


Gin 


Pro Val 


Gin 


His 


Asn 






115 






12 0 








125 








Trp 


Lys 


Thr 


Met 


Ala 


Glu Ala Val 


Gin 


Asn 


His 


Val Lys 


Ser 


Leu 


Asn 
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i ^ n 

jl j yj 










X. _J 










140 










ir P 


f2l -vr 


xx X fa 




Val 


Gin 


XJC n 


Gl n 


X4i3 LJ 


Arg 


Lys 


Val 


Lys 


Tyr 




A en 


XT J 










150 










155 










160 

_L_ \J y J 


Tip 




Hid 


OCX 


Phe 


Val 


Asp 


Gl ii 

V3XU 


xll s 


Thr Val Arg Gly Val 




xj y o 










165 










170 










175 




Gl v 


vjri y 




Al a 

xVl d 

180 

-1- V-* V 


Thr 


Leu 


Leu 


*J C 1 


A 1 a 

ftl £X 

185 


Glu 


His 


He 


Val 


He 

_jL 

190 


Ala 

1X1 *-«X- 


Thr 


VJJX Jf 


Gl V 


jmx y 


Pro 


Arg 


Tyr 


Pro 


Thr- 

J- 1 jV_L_ 


GJ n 


Val 


Lys 


Gly Ala 


XJ C IX 


Glu 


Tvr 

i y x 






195 










20 0 










2 05 

*-M \J l»J 








VJX jr 


11C 

210 

A» «A» V 


1 1X1 


O C 1 


Asp 


Asp 


He 
215 


Phe 


XXj\J 


Leu 


Lys 


Glu 
220 


Cpr 


Prn 


Glv 


Lvs 




XJC IX 


v a x 


val 


Gly Ala 


Ser 


Tyr 


Val 
v ax 


Ala 


Leu 


Glu 


Cvs 


Al a 


Glv 


Phe 

XT XX w 


225 










230 










235 










240 


lieu 


X I1X 


vjr 1 y 


11c 


Gly 


Leu Asp 


Thr 


XXlx 


Val 


Met 


Met 


Ara 

m- y 


t— ' V — - -L_ • 


Tie 

1 1 c 


Prn 










245 










250 










255 




JU6U 


i\rg 


uiy 


ir lie 




Gin 


Gin 


Met 


c e> >- 

JCl 

7 fie; 

t> U J 


Ser 


Leu 


Val 


Thr 


Glu 
270 


1X1 C3 


Met 

ric iw 


vjl IX 


C! d "V" 
□ CI 


til O 

ins 


Gl 


TVi r* 

X 111 


Gin 

V_J _L x x 


Phe 


Leu 


T.vc 


Gly Cys 


Val 


Pro 




His 


He 
















280 










285 








xjy a 


Jjy £3 


1ICU 


riu 


Tin t~ 

X XXI 


^\*3 X X 


Gin 


Leu 


Gl n 

will 


Val 


Thr 


xrp 


f3l 11 
vjj 1 IX 




His 


Ala 


290 










295 










300 










O p» -v- 

O CX 


VJX.J/ 


T ,VR 

j-i _y o 


Glu 




Thr 

**- l J x. 


Gly Thr 


Phe 


Asp 


Thr 


Va 1 

V ell 


XJC IX 


A - 1 - xr 


Ala 


He 


305 










310 










315 










320 


Glv 


A7~CT 


Val 

V d 1 


Jr X 


Glu 
325 




Arg 


Thr 




Asn 
330 


Leu 


Glu 


TjVS 

JLJJ/ 0 


Ala 


Gly 
335 


He 


Ser 

U V-> -i— 


Thr 




Pro 
340 


TiVS 


Asn 


Gin 


Lys 


He 

-J- «X> Vp* 

345 


He 


Val 


Asp 


Ala 


Gin 
350 


Glu 


Ala 


X 11X 


OCX 


V al 


XT X tJ 


His 


He 


Tyr Ala 


lie 


Gly Asp 


Val 


Ala 


Glu 


Gly Arg 






355 










360 










365 








JrxO 


IjlU 


XJCU 


X 1117 


Pro 


Thr 


Ala 


He 


xjy s 


Ala Gly 


Lys 


XJCU 


Tien 


Ala 


Gin 




-i7n 










375 










380 












bell 


±r ne 




Lys 


Ser 


Ser 


Thr 


Leu 


Met 


Asp 


Tyr 


Ser 




Val 


Pro 


385 










390 










395 










400 


J. XIX 


r P V\ -y~ 
X XXX 


V Gil 


xrXlc 


Thr 
405 


Pro 


Leu 


Glu 


Tyr 


Gly 
410 


Cys 


Val 


Gly 


XJC IX 


Ser 
415 


Glu 


gi ti 

ulU 


Gl n 

varl IX 


£\ X d 


Vn 1 
val 

420 


Ala 


Leu 


His 


Gly 


Gin 
425 


Glu 


His 


Val 


Glu 


Val 
430 


Tyr 


His 




±yr 


Tift* 

iyr 
435 


xjy s 


Pro 


Leu 


Glu 


Phe 
440 


Thr 


Val 


Ala 


Asp 


Arg 
445 


Asp 


Ala 


Ser 


Gin 


Cys 
450 


Tyr 


lie 


Lys 


Met 


Val 
455 


Cys 


Met 


Arg 


Glu 


Pro 
460 


Pro 


Gin 


Leu 


Val 


Leu 


Gly 


Leu 


His 


Phe 


Leu 


Gly 


Pro 


Asn Ala Gly Glu 


Val 


Thr 


Gin 


Gly 


465 










470 










475 










480 


Phe 


Ala 


Leu 


Gly 


He 
485 


Lys 


Cys 


Gly 


Ala 


Ser 
490 


Tyr 


Ala 


Gin 


Val 


Met 
495 


Gin 


Thr 


Val 


Gly 


He 
500 


His 


Pro 


Thr 


Cys 


Ser 
505 


Glu 


Glu 


Val 


Val 


Lys 
510 


Leu 


His 


lie 


Ser 


Lye 
515 


Arg 


Ser 


Gly 


Leu 


Glu 
520 


Pro 


Thr 


Val 


Thr 


Gly 
525 


Cys 


Xaa 


Gly 



<210> 305 
<211> 520 
<212> PRT 
<213> Mus 



mus cuius 



<400> 305 
Met Val Ala Ala 
1 

Arg Ala Leu Thr 

20 

Gin Gin Ser Phe 
35 

Ala Cys Ala Lys 
50 

Asp Tyr Val Glu 
65 



Leu Arg Gly Pro 
5 

Arg Gly Thr Arg 

Asp Leu Leu Val 

40 

Glu Ala Ala Gin 
55 

Pro Ser Pro Arg 
70 



Ser Arg Arg Phe 
10 

Gly Ala Ala Ser 
25 

He Gly Gly Gly 

Leu Gly Lys Lys 

60 

Gly Thr Lys Trp 
75 



Arg Pro Arg Thr 
15 

Ala Ala Gly Gly 
30 

Ser Gly Gly Leu 
45 

Val Ala Val Ala 

Gly Leu Gly Gly 

80 
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Thr Cys Val Asn 

Ala Leu Leu Gly 

100 

Val Ala Gin Pro 
115 

Gin Asn His Val 
130 

Asp Arg Lys Val 
145 

His Thr Val Arg 

Ala Glu His He 

180 

Gin Val Lys Gly 
195 

Trp Leu Lys Glu 
210 

Val Ala Leu Glu 
225 

Thr Val Met Met 

Ser Ser Leu Val 

260 

Lys Gly Cys Val 
275 

Gin Val Thr Trp 
290 

Phe Asp Thr Val 
305 

Leu Asn Leu Glu 

He He Val Asp 

340 

He Gly Asp Val 
355 

Lys Ala Gly Lys 
370 

Leu Met Asp Tyr 
385 

Tyr Gly Cys Val 

Gin Glu His Val 

420 

Thr Val Ala Asp 
435 

Met Arg Glu Pro 
450 

Asn Ala Gly Glu 
465 

Ala Ser Tyr Ala 

Ser Glu Glu Val 

500 

Pro Thr Val Thr 
515 



Val Gly Cys He 
85 

Gly Met He Arg 

Val Gin His Asn 

120 

Lys Ser Leu Asn 
135 

Lys Tyr Phe Asn 
150 

Gly Val Asp Lys 
165 

Val He Ala Thr 

Ala Leu Glu Tyr 

200 

Ser Pro Gly Lys 
215 

Cys Ala Gly Phe 
230 

Arg Ser He Pro 
245 

Thr Glu His Met 

Pro Ser His He 

2B0 

Glu Asp His Ala 
295 

Leu Trp Ala He 
310 

Lys Ala Gly He 
325 

Ala Gin Glu Ala 

Ala Glu Gly Arg 

360 

Leu Leu Ala Gin 
375 

Ser Asn Val Pro 
390 

Gly Leu Ser Glu 
405 

Glu Val Tyr His 

Arg Asp Ala Ser 

440 

Pro Gin Leu Val 
455 

Val Thr Gin .Gly 
470 

Gin Val Met Gin 
485 

Val Lys Leu His 

Gly Cys Cys Gly 

520 



Pro Lys Lys Leu 
90 

Asp Ala His His 
105 

Trp Lys Thr Met 

Trp Gly His Arg 

140 

He Lys Ala Ser 
155 

Gly Gly Lys Ala 
170 

Gly Gly Arg Pro 
185 

Gly lie Thr Ser 

Thr Leu Val Val 

220 

Leu Thr Gly He 
235 

Leu Arg Gly Phe 
250 

Glu Ser His Gly 
265 

Lys Lys Leu Pro 

Ser Gly Lys Glu 

300 

Gly Arg Val Pro 
315 

Ser Thr Asn Pro 
330 

Thr Ser Val Pro 
345 

Pro Glu Leu Thr 

Arg Leu Phe Gly 

380 

Thr Thr Val Phe 
395 

Glu Glu Ala Val 
410 

Ala Tyr Tyr Lys 
425 

Gin Cys Tyr He 

Leu Gly Leu His 

460 

Phe Ala Leu Gly 
475 

Thr Val Gly He 
490 

He Ser Lys Arg 
505 



Met His Gin Ala 
95 

Tyr Gly Trp Glu 
110 

Ala Glu Ala Val 
125 

Val Gin Leu Gin 

Phe Val Asp Glu 

160 

Thr Leu Leu Ser 
175 

Arg Tyr Pro Thr 
190 

Asp Asp He Phe 
205 

Gly Ala Ser Tyr 

Gly Leu Asp Thr 

240 

Asp Gin Gin Met 
255 

Thr Gin Phe Leu 
270 

Thr Asn Gin Leu 
285 

Asp Thr Gly Thr 

Glu Thr Arg Thr 

320 

Lys Asn Gin Lys 
335 

His He Tyr Ala 
350 

Pro Thr Ala He 
365 

Lys Ser Ser Thr 

Thr Pro Leu Glu 

400 

Ala Leu His Gly 
415 

Pro Leu Glu Phe 
430 

Lys Met Val Cys 
445 

Phe Leu Gly Pro 

He Lys Cys Gly 

480 

His Pro Thr Cys 
495 

Ser Gly Leu Glu 
510 



<210> 306 
<211> 499 
<212> PRT 

<213> Mus mus cuius 



<400> 306 

Met Asn Gly Ser Lys Asp Pro Pro 

1 5 
He He Gly Gly Gly Ser Gly Gly 



Gly Ser Tyr Asp Phe Asp Leu He 

10 15 
Leu Ala Ala Ala Lys Glu Ala Ala 
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20 

Lys Phe Asp Lys 
35 

Leu Gly Thr Arg 
50 

lie Pro Lys Lys 
65 

Lys Asp Ser Arg 

Asp Trp Glu Lys 

100 

Asn Trp Gly Tyr 
115 

Asn Ala Tyr Gly 
130 

Asn Lys Gly Lys 
145 

Thr Gly Glu Arg 

Cys lie Ser Ser 

180 

Thr Leu Val Val 
195 

Leu Ala Gly lie 
210 

Leu Arg Gly Phe 
225 

Glu Glu His Gly 

Glu Gin lie Glu 

260 

Ser Thr Asn Ser 
275 

Leu Ala Val Gly 
290 

Val Gly Val Lys 
305 

Glu Glu Gin Thr 

Glu Gly Lys Leu 

340 

Leu Ala Gin Arg 
355 

Asn Val Pro Thr 
370 

Leu Ser Glu Glu 
385 

Val Tyr His Ser 

Asp Asn Asn Lys 

420 

Glu Arg Val Val 
435 

Thr Gin Gly Phe 
450 

Leu Asp Ser Thr 
465 

Thr Leu Ser Val 
Cys Cys Gly 



Lys Val Leu Val 

40 

Trp Gly Leu Gly 
55 

Leu Met His Gin 

70 

Asn Tyr Gly Trp 
85 

Met Thr Glu Ser 

Arg Val Ala Leu 

120 

Arg Phe lie Gly 
135 

Glu Lys lie Tyr 
150 

Pro Arg Tyr Leu 
165 

Asp Asp Leu Phe 

Gly Ala Ser Tyr 

200 

Gly Leu Asp Val 
215 

Asp Gin Asp Met 
230 

lie Lys Phe lie 
245 

Ala Gly Thr Pro 

Glu Glu Thr lie 

280 

Arg Asp Ser Cys 
295 

lie Asn Glu Lys 
310 

Asn Val Pro Tyr 
325 

Glu Leu Thr Pro 

Leu Tyr Gly Gly 

360 

Thr Val Phe Thr 
375 

Lys Ala Val Glu 
390 

Phe Phe Trp Pro 
405 

Cys Tyr Ala Lys 

Gly Phe His Val 

440 

Ala Ala Ala Leu 
455 

lie Gly He His 
470 

Thr Lys Arg Ser 
485 



25 

Leu Asp Phe Val 

Gly Thr Cys Val 

60 

Ala Ala Leu Leu 
75 

Lys Val Glu Asp 
90 

Val Gin Ser His 
105 

Arg Glu Lys Lys 

Pro His Arg He 

140 

Ser Ala Glu Arg 
155 

Gly He Pro Gly 
170 

Ser Leu Pro Tyr 
185 

Val Ala Leu Glu 

Thr Val Met Val 

220 

Ala Asn Lys He 
235 

Arg Gin Phe Val 
250 

Gly Arg Leu Arg 
265 

Glu Gly Glu Phe 

Thr Arg Thr He 

300 

Thr Gly Lys He 
315 

He Tyr Ala He 
330 

Val Ala He Gin 
345 

Ser Asn Val Lys 

Pro Leu Glu Tyr 

380 

Lys Phe Gly Glu 
395 

Leu Glu Trp Thr 
410 

He He Cys Asn 
425 

Leu Gly Pro Asn 

Lys Cys Gly Leu 

460 

Pro Val Cys Ala 
475 

Gly Gly Asp He 
490 



30 

Thr Pro Thr Pro 
45 

Asn Val Gly Cys 

Gly Gin Ala Leu 

80 

Thr Val Lys His 
95 

He Gly Ser Leu 
110 

Val Val Tyr Glu 
125 

Val Ala Thr Asn 

Phe Leu He Ala 

160 

Asp Lys Glu Tyr 
175 

Cys Pro Gly Lys 
190 

Cys Ala Gly Phe 
205 

Arg Ser He Leu 

Gly Glu His Met 

240 

Pro Thr Lys He 
255 

Val Thr Ala Gin 
270 

Asn Thr Val Leu 
285 

Gly Leu Glu Thr 

Pro Val Thr Asp 

320 

Gly Asp He Leu 
335 

Ala Gly Arg Leu 
350 

Cys Asp Tyr Asp 
365 

Gly Cys Cys Gly 

Glu Asn He Glu 

400 

Val Pro Ser Arg 
415 

Leu Lys Asp Asp 
43 0 

Ala Gly Glu Val 
445 

Thr Lys Gin Gin 

Glu He Phe Thr 

480 

Leu Gin Ser Gly 
4 95 



<210> 307 
<211> 497 
<212> PRT 
<213> Rattus 



norvegicus 
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<220> 

<221> VARIANT 
<222> 497 

<223> Xaa = Any Amino Acid 



<400> 307 



Met 


Asn 


Asp 


Ser 


Lys 


Asp 


Ala 


Pro 


Lys 


Ser 


Tyr 


Asp 


Phe 


Asp 


Leu 


He 


1 








5 










10 










15 




lie 


lie 


Gly 


Gly 


Gly 


Ser 


Gly Gly 


Leu 


Ala 


Ala 


Ala 


Lys 


Glu 


Ala 


Ala 








20 










25 










30 






Lys 


Phe 


Asp 


Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 


Val 


Thr 


Pro 


Thr 


Pro 






35 










40 










45 








Leu 


Gly 


Thr 


Asn 


Gly 


Gly 


Leu Gly 


Gly 


Thr 


Cys 


Val 


Asn 


Val 


Gly 


Cys 




50 










55 










60 










lie 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly Gin Ala 


Leu 


65 








70 










75 










80 


Lys 


Asp 


Ser 


Arg 


Asn 


Tyr 


Gly Trp 


Lys 


Leu 


Glu 


Asp 


Thr 


Val 


Lys 


His 










85 










90 










95 




Asp 


Trp 


Glu 


Lys 


Met 


Thr 


Glu 


Ser 


Val 


Gin 


Asn 


His 


He 


Gly 


Ser 


Leu 








100 










105 










110 






Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Ala 


Leu 


Arg 


Glu 


Lys 


Lys 


Val 


Val 


Tyr 


Glu 






115 










120 










125 








Asn 


Ala 


Tyr 


Gly 


Lys 


Phe 


He 


Gly 


Pro 


His 


Lys 


He 


Met 


Ala 


Thr 


Asn 




130 










135 










140 










Asn 


Lys 


Gly 


Lys 


Glu 


Lys 


Val 


Tyr 


Ser 


Ala 


Glu 


Arg 


Phe 


Leu 


He 


Ala 


145 










150 










155 










160 


Thr Gly Glu 


Arg 


Pro 


Arg 


Tyr 


Leu 


Gly 


He 


Pro 


Gly 


Asp 


Lys 


Glu 


Tyr 










165 










170 










175 




Cys 


He 


Ser 


Ser 


Asp 


Asp 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


Cys 


Pro 


Gly 


Lys 








180 










185 










190 






Thr 


Leu 


Val 


Val 


Gly 


Ala 


Ser 


Tyr 


Val 


Ala 


Leu 


Glu 


Cys 


Ala 


Gly 


Phe 






195 










200 










205 








Leu Ala Gly 


He 


Gly 


Leu 


Asp Val 


Thr 


Val 


Met 


Val 


Arg 


Ser 


He 


Leu 




210 










215 










220 










Leu 


Arg 


Gly 


Phe 


Asp 


Gin 


Asp 


Met 


Ala 


Asn 


Lys 


He 


Gly 


Glu 


His 


Met 


225 










230 










235 










240 


Glu 


Glu 


His 


Gly 


He 


Lys 


Phe 


He 


Arg 


Gin 


Phe 


Val 


Pro 


Thr 


Lys 


He 










245 










250 










255 




Glu 


Gin 


He 


Glu 


Ala 


Gly 


Thr 


Pro 


Gly 


Arg 


Leu 


Lys 


Val 


Thr 


Ala 


Lys 








260 










265 










270 






Ser 


Thr 


Asn 


Ser 


Glu 


Glu 


Thr 


He 


Glu 


Asp 


Glu 


Phe 


Asn 


Thr 


Val 


Leu 






275 










2B0 








285 








Leu 


Ala 


Val 


Gly 


Arg 


Asp 


Ser 


Cys 


Thr 


Arg 


Thr 


He 


Gly 


Leu 


Glu 


Thr 




290 










295 










300 










Val 


Gly 


Val 


Lys 


He 


Asn 


Glu 


Lys 


Thr 


Gly 


Lys 


He 


Pro 


Val 


Thr 


Asp 


305 










310 










315 










320 


Glu 


Glu 


Gin 


Thr 


Asn 


Val 


Pro 


Tyr 


He 


Tyr 


Ala 


He 


Gly Asp 


He 


Leu 










325 










330 










335 




Glu 


Gly Lys 


Leu 


Glu 


Leu 


Thr 


Pro 


Val 


Ala 


He 


Gin 


Ala 


Gly Arg 


Leu 








340 










345 










350 






Leu 


Ala 


Gin 


Arg 


Leu 


Tyr 


Gly Gly 


Ser 


Thr 


Val 


Lys 


Cys 


Asp 


Tyr 


Asp 






355 










360 










365 








Asn 


Val 


Pro 


Thr 


Thr 


Val 


Phe 


Thr 


Pro 


Leu 


Glu 


Tyr 


Gly 


Cys 


Cys 


Gly 




370 










375 










380 










Leu 


Ser 


Glu 


Glu 


Lys 


Ala 


Val 


Glu 


Lys 


Phe 


Gly 


Glu 


Glu 


Asn 


He 


Glu 


385 










390 










395 










400 


Val 


Tyr 


His 


Ser 


Phe 


Phe 


Trp 


Pro 


Leu 


Glu 


Trp 


Thr 


Val 


Pro 


Ser 


Arg 










405 










410 










415 




Asp 


Asn 


Asn 


Lys 


Cys 


Tyr 


Ala 


Lys 


Val 


He 


Cys 


Asn 


Leu 


Lys 


Asp 


Asn 








420 










425 










430 






Glu Arg 


Val 


Val 


Gly 


Phe 


His 


Val 


Leu 


Gly 


Pro 


Asn 


Ala Gly Glu Val 






435 










440 








445 








Thr 


Gin 


Gly 


Phe 


Ala 


Ala 


Ala 


Leu 


Lys 


Cys 


Gly 


Leu 


Thr 


Lys 


Gin 


Gin 




450 










455 








460 










Leu Asp 


Ser 


Thr 


He 


Gly 


He 


His 


Pro 


Val 


Cys 


Ala 


Glu 


He 


Phe 


Thr 


465 










470 










475 










480 


Thr 


Leu 


Ser 


Val 


Thr 


Lys 


Arg 


Ser 


Gly 


Gly Asp 


He 


Leu 


Gin 


Ser 


Gly 
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485 



Xaa 



490 



495 



<210> 308 
<21X> 176 
<212> PRT 

<213> Rat tua norvegicua 



<400> 308 



Arg 


lie 


His 


Ala 


Gly 


Gly 


Ala 


Gly Arg 


Arg Arg 


Gly 


Gly Ala 


Arg Arg 


1 








5 








10 










15 




Ala 


Gly Val 


Phe 


He 


Leu 


Leu 


Ala His 


Pro 


Asn 


Lys 


Lys 


Gly 


Leu 


Leu 








20 








25 








30 






Arg 


Lys 


Leu 
35 


Ser 


Thr 


Met 


Asn 


Asp Ser 
40 


Lys 


Asp 


Ala 


Pro 
45 


Lys 


Ser 


Tyr 


Asp 


Phe 


Asp 


Leu 


He 


He 


He 


Gly Gly Gly Ser 


Gly Gly 


Leu 


Ala 


Ala 




50 










55 








60 










Ala 


Lys 


Glu 


Ala 


Ala 


Lys 


Phe 


Asp Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 


65 










70 








75 










80 


Val 


Thr 


Pro 


Thr 


Pro 
85 


Leu 


Gly 


Thr Arg 


Trp 
90 


Gly 


Leu 


Gly 


Gly 


Thr 
95 


Cys 


Val 


Asn 


Val 


Gly 
100 


Cys 


He 


Pro 


Lys Lys 
105 


Leu 


Met 


His 


Gin 


Ala 
110 


Ala 


Leu 


Leu Gly Gin Ala 


Leu 


Lys 


Asp 


Ser Arg 


Asn 


Tyr 


Gly 


Trp 


Lys 


Leu 


Glu 






115 










120 








125 








Asp 


Thr 
130 


Val 


Lys 


His 


Asp 


Trp 
135 


Glu Lys 


Met 


Thr 


Glu 
140 


Ser 


Val 


Gin 


Asn 


His 


lie 


Gly 


Ser 


Leu 


Asn 


Trp 


Gly Tyr 


Arg 


Val 


Ala 


Leu 


Arg 


Glu 


Lys 


145 










150 








155 










160 


Lys 


Val 


Val 


Tyr 


Glu 

165 


Asn 


Ala 


Tyr Gly 


Lys 
170 


Phe 


He 


Gly 


Pro 


His 
175 


Lys 
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<400> 309 



Met 


Asn 


Asp 


Ser 


Lys 


Asp 


Ala 


Pro 


Lys 


Ser 


Tyr 


Asp Phe 


Asp 


Leu 


He 


1 








5 










10 








15 




He 


He 


Gly 


Gly 


Gly 


Ser 


Gly 


Gly 


Leu 


Ala 


Ala 


Ala Lys 


Glu 


Ala 


Ala 








20 










25 








30 






Lys 


Phe 


Asp 


Lys 


Lys 


Val 


Met 


Val 


Leu 


Asp 


Phe 


Val Thr 


Pro 


Thr 


Pro 






35 










40 








45 








Leu 


Gly 


Thr 


Asn 


Gly 


Gly 


Leu 


Gly Gly Thr Cys Val Asn 


Val 


Gly 


Cys 




50 










55 










60 








He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu Gly Gin 


Ala 


Leu 


65 










70 










75 








80 


Lys 


Asp 


Ser 


Arg 


Asn 


Tyr 


Gly 


Trp 


Lys 


Leu 


Glu 


Asp Thr 


Val 


Lys 


His 










85 










90 








95 




Asp 


Trp 


Glu 


Lys 


Met 


Thr 


Glu 


Ser 


Val 


Gin 


Asn 


His He 


Gly 


Ser 


Leu 








100 










105 








110 






Asn 


Trp 


Gly 


Tyr 


Arg 


Val 


Ala 


Leu 


Arg 


Glu 


Lys 


Lys Val 


Val 


Tyr 


Glu 






115 










120 








125 






Asn 


Ala 


Tyr 


Gly 


Lys 


Phe 


He 


Gly 


Pro 


His 


Lys 


He Met 


Ala 


Thr 


Asn 




130 










135 










140 








Asn 


Lys 


Gly 


Lys 


Glu 


Lys 


Val 


Tyr 


Ser 


Ala 


Glu Arg Phe 


Leu 


He 


Ala 


145 










150 










155 








160 


Thr 


Gly 


Glu 


Arg 


Pro 


Arg 


Tyr 


Leu 


Gly 


He 


Pro 


Gly Asp 


Lys 


Glu 


Tyr 
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Cys lie Ser Ser 

180 

Thr Leu Val Val 
195 

Leu Ala Gly lie 
210 

Leu Arg Gly Phe 
225 

Glu Glu His Gly 

Glu Gin He Glu 

260 

Ser Thr Asn Ser 
275 

Leu Ala Val Gly 
290 

Val Gly Val Lys 
305 

Glu Glu Gin Thr 

Glu Gly Lys Leu 

340 

Leu Ala Gin Arg 
355 

Asn Val Pro Thr 
370 

Leu Ser Glu Glu 
385 

Val Tyr His Ser 

Asp Asn Asn Lys 

420 

Glu Arg Val Val 
435 

Thr Gin Ala Leu 
450 

Asp Ser Thr He 
465 

Leu Ser Val Thr 
Xaa Gly 



165 

Asp Asp Leu Phe 

Gly Ala Ser Tyr 

200 

Gly Leu Asp Val 
215 

Asp Gin Asp Met 
230 

He Lys Phe He 
245 

Ala Gly Thr Pro 

Glu Glu Thr He 

280 

Arg Asp Ser Cys 
2 95 

He Asn Glu Lys 
310 

Asn Val Pro Tyr 
325 

Glu Leu Thr Pro 

Leu Tyr Gly Gly 

360 

Thr Val Phe Thr 
375 

Lys Ala Val Glu 
390 

Phe Phe Trp Pro 
405 

Cys Tyr Ala Lys 

Gly Phe His Val 

440 

Gin Pro Leu Lys 
455 

Gly He His Pro 
470 

Lys Arg Ser Gly 
485 



170 

Ser Leu Pro Tyr 
185 

Val Ala Leu Glu 

Thr Val Met Val 

220 

Ala Asn Lys He 
235 

Arg Gin Phe Val 
250 

Gly Arg Leu Lys 
265 

Glu Asp Glu Phe 

Thr Arg Thr He 

300 

Thr Gly Lys He 
315 

He Tyr Ala He 
330 

Val Ala He Gin 
345 

Ser Thr Val Lys 

Pro Leu Glu Tyr 

380 

Lys Phe Gly Glu 
395 

Leu Glu Trp Thr 
410 

Val He Cys Asn 
425 

Leu Gly Pro Asn 

Cys Gly Leu Thr 

460 

Val Cys Ala Glu 
475 

Gly Asp He Leu 
490 



175 

Cys Pro Gly Lys 
190 

Cys Ala Gly Phe 
205 

Arg Ser He Leu 

Gly Glu His Met 

240 

Pro Thr Lys He 
255 

Val Thr Ala Lys 
270 

Asn Thr Val Leu 
285 

Gly Leu Glu Thr 

Pro Val Thr Asp 

320 

Gly Asp He Leu 
335 

Ala Gly Arg Leu 
350 

Cys Asp Tyr Asp 
365 

Gly Cys Cys Gly 

Glu Asn lie Glu 

400 

Val Pro Ser Arg 
415 

Leu Lys Asp Asn 
430 

Ala Gly Glu Val 
445 

Lys Gin Gin Leu 

He Phe Thr Thr 

480 

Gin Ser Gly Cys 
495 



<210> 310 
<211> 11 
<212> PRT 

<213> Rattus norvegicus 
<400> 310 

Met Asn Asp Ser Lys Asp Ala Pro Lys Ser Tyr 
1 5 10 



<210> 311 
<211> 496 
<212> PRT 

<213> Rattus norvegicus 
<400> 311 

Met Asn Asp Ser Lys Asp Ala Pro 

1 5 
He He Gly Gly Gly Ser Gly Gly 

20 

Lys Phe Asp Lys Lys Val Met Val 
35 40 



Lys Ser Tyr Asp Phe Asp Leu He 

10 15 
Leu Ala Ala Ala Lys Glu Ala Ala 
25 "* 30 

Leu Asp Phe Val Thr Pro Thr Pro 

45 
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Leu 


Gly 


Thr 


Asn 


Gly 


Gly 


Leu 


Gly Gly Thr 


Cys 


Val 


Asn 


Val 


Gly 


Cys 




50 










55 










60 










lie 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly 


Gin 


Ala 


Leu 


65 










70 










75 










80 


Lys 


Asp 


Ser 


Arg 


Asn 


Tyr 


Gly 


Trp 


Lys 


Leu 


Glu 


Asp 


Thr 


Val 


Lys 


His 










85 










90 










95 




Asp 


Trp 


Glu 


Lys 


Met 


Thr 


Glu 


Ser 


Val 


Gin 


Asn 


His 


He 


Gly 


Ser 


Leu 








100 










105 










110 






Aan 


Trp 


Gly 


Tyr 


Arg 


Val 


Ala 


Leu 


Arg 


Glu 


Lys 


Lys 


Val 


Val 


Tyr 


Glu 






115 










120 










125 








Asn 


Ala 


Tyr 


Gly 


Lys 


Phe 


He 


Gly 


Pro 


His 


Lys 


He 


Met 


Ala 


Thr 


Asn 




130 










135 










140 










Asn 


Lys 


Gly 


Lys 


Glu 


Lys 


Val 


Tyr 


Ser 


Ala 


Glu 


Arg 


Phe 


Leu 


He 


Ala 


145 










150 










155 










160 


Thr Gly 


Glu Arg 


Pro 


Arg 


Tyr 


Leu 


Gly 


He 


Pro 


Gly 


Asp 

** 


Lys 


Glu 


Tyr 










165 










170 










175 




Cys 


lie 


Ser 


Ser 


Asp 


Asp 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


Cys 


Pro 


Gly 


Lys 








180 










185 










190 






Thr 


Leu 


Val 


Val 


Gly 


Ala 


Ser 


Tyr 


Val 


Ala 


Leu 


Glu 


Cys 


Ala 


Gly 


Phe 






195 










200 










205 








Leu 


Ala 


Gly 


lie 


Gly 


Leu 


Asp 


Val 


Thr 


Val 


Met 


Val 


Arg 


Ser 


He 


Leu 




210 










215 










220 










Leu 


Arg 


Gly 


Phe 


Asp 


Gin 


Asp 


Met 


Ala 


Asn 


Lys 


He 


Gly 


Glu 


His 


Met 


225 










230 










235 










240 


Glu 


Glu 


His 


Gly 


He 


Lys 


Phe 


He 


Arg 


Gin 


Phe 


Val 


Pro 


Thr 


Lys 


He 










245 










250 










255 




Glu 


Gin 


lie 


Glu 


Ala 


Gly 


Thr 


Pro 


Gly Arg 


Leu 


Lys 


Val 


Thr 


Ala 


Lys 








260 










265 










270 






Ser 


Thr 


Asn 


Ser 


Glu 


Glu 


Thr 


He 


Glu Asp 


Glu 


Phe 


Asn 


Thr 


Val 


Leu 






275 










280 










285 








Leu 


Ala 


Val 


Gly 


Arg 


Asp 


Ser 


Cys 


Thr Arg 


Thr 


He 


Gly 


Leu 


Glu 


Thr 




290 










295 










300 










Val 


Gly 


Val 


Lys 


He 


Asn 


Glu 


Lys 


Thr 


Gly 


Lys 


He 


Pro 


Val 


Thr 


Asp 


305 










310 










315 










320 


Glu 


Glu 


Gin 


Thr 


Asn 


Val 


Pro 


Tyr 


ZC 1 


Tyr 


Ala 


He 


Gly 


Asp 


He 


Leu 










325 










330 










335 




Glu 


Gly 


Lys 


Leu 


Glu 


Leu 


Thr 


Pro 


Val 


Ala 


He 


Gin 


Ala 


Gly 


Arg 


Leu 








340 










345 










350 






Leu 


Ala 


Gin 


Arg 


Leu 


Tyr 


Gly 


Gly 


Ser 


Thr 


Val 


Lys 


Cys 


Asp 


Tyr 


Asp 






355 










360 










365 








Asn 


Val 


Pro 


Thr 


Thr 


Val 


Phe 


Thr 


Pro 


Leu 


Glu 


Tyr 


Gly 


Cys 


Cys 


Gly 




370 










375 










380 










Leu 


Ser 


Glu 


Glu 


Lys 


Ala 


Val 


Glu 


Lys 


Phe 


Gly 


Glu 


Glu 


Asn 


He 


Glu 


385 








390 










395 










400 


Val 


Tyr 


His 


Ser 


Phe 


Phe 


Trp 


Pro 


Leu 


Glu 


Trp 


Thr 


Val 


Pro 


Ser 


Arg 








405 








410 










415 




Asp 


Asn 


Asn 


Lys 


Cys 


Tyr 


Ala 


Lys 


Val 


He 


Cys 


Asn 


Leu 


Lys 


Asp 


Asn 








420 










425 










430 






Glu 


Arg 


Val 


Val 


Gly 


Phe 


His 


Val 


Leu Gly 


Pro 


Asn 


Ala 


Gly 


Glu 


Val 






435 










440 










445 








Thr 


Gin 


Ala 


Leu 


Gin 


Pro 


Leu 


Lys 


Cys 


Gly 


Leu 


Thr 


Lys 


Gin 


Gin 


Leu 




450 










455 










460 










Asp 


Ser 


Thr 


He 


Gly 


He 


His 


Pro 


Val 


Cys 


Ala 


Glu 


He 


Phe 


Thr 


Thr 


465 








470 








475 










480 


Leu 


Ser 


Val 


Thr 


Lys 


Arg 


Ser 


Gly Gly Asp 


He 


Leu 


Gin 


Ser 


Gly 


Cys 










485 










490 










495 
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<400> 312 




Met 


Ala 


Ala 


He 


1 








Pro 


Gin 


Thr 


Arg 








20 


Ala 


Ser 


Ala 


Ala 






35 




Gly Gly Ser 


Gly 




50 






Arg 


Lys 


Val 


Ala 


65 








Lys 


Trp 


Gly 


Leu 


Lye 


Leu 


Met 


His 








100 


Gin 


His 


Tyr 


Gly 






115 




Ala 


Met 


Ala 


Glu 




130 






His 


Arg Val 


Gin 


145 








Ala 


Ser 


Phe 


Val 


Lys 


Val 


Thr 


Gin 








180 


Arg 


Pro 


Lys 


Tyr 






195 




Thr 


Ser 


Asp 


Asp 




210 






Val 


Val 


Gly 


Ala 


225 








Gly lie 


Gly 


Leu 


Gly Phe Asp 


Gin 








260 


His 


Gly Thr 


Arg 






275 




Leu 


Pro 


Thr 


Asn 




290 






Lys 


Glu 


Asp 


Val 


305 








Val 


Pro 


Glu 


Thr 


Asn 


Pro 


Lys 


Asn 








340 


Val 


Pro 


His 


He 






355 




Leu 


Thr 


Pro 


Thr 




370 






Phe 


Gly 


Lys 


Ser 


385 








Val 


Phe 


Thr 


Pro 


Ala 


Val 


Ala 


Leu 








420 


Tyr 


Lys 


Pro 


Leu 






435 




Tyr 


He 


Lys 


Met 




450 






Leu 


His 


Phe 


Leu 


465 








Leu 


Gly 


He 


Gin 


Gly lie 


His 


Pro 








500 


Lys 


Arg 


Ser 


Gly 






515 





Val Ala Ala Leu 
5 

Val Leu Thr Arg 

Gly Gly Gin Gin 

40 

Gly Leu Ala Cys 
55 

Val Ala Asp Tyr 
70 

Gly Gly Thr Cys 
85 

Gin Ala Ala Leu 

Trp Glu Val Ala 

120 

Ala Val Gin Asn 
135 

Leu Gin Asp Arg 
150 

Asn Glu His Thr 
165 

Leu Ser Ala Lys 

Pro Thr Gin Val 

200 

lie Phe Trp Leu 
215 

Ser Tyr Val Ala 
230 

Asp Thr Thr Val 
245 

Gin Met Ala Ser 

Phe Leu Lys Gly 

280 

Gin Leu Gin Val 
295 

Gly Thr Phe Asp 
310 

Arg Asn Leu Asn 
325 

Gin Lys He He 

Tyr Ala He Gly 

360 

Ala He Lys Ala 
375 

Ser Thr Leu Met 
390 

Leu Glu Tyr Gly 
405 

His Gly Gin Glu 

Glu Phe Thr Val 

440 

Val Cys Met Arg 
455 

Gly Pro Asn Ala 
470 

Cys Gly Ala Ser 
485 

Thr Cys Ser Glu 

Leu Asp Pro Thr 

520 



Arg Gly Ser Ser 
10 

Gly Thr Arg Gly 
25 

Asn Phe Asp Leu 

Ala Lys Glu Ala 

60 

Val Glu Pro Ser 
75 

Val Asn Val Gly 
90 

Leu Gly Gly Met 
105 

Gin Pro Val Gin 

Hi s Val Lys Ser 

140 

Lys Val Lys Tyr 
155 

Val His Gly Val 
170 

His He Val He 
185 

Lys Gly Ala Leu 

Lys Glu Ser Pro 

220 

Leu Glu Cys Ala 
235 

Met Met Arg Ser 
250 

Leu Val Thr Glu 
265 

Cys Val Pro Ser 

Thr Trp Glu Asp 

300 

Thr Val Leu Trp 
315 

Leu Glu Lys Ala 
330 

Val Asp Ala Gin 
345 

Asp Val Ala Glu 

Gly Lys Leu Leu 

380 

Asn Tyr Ser Asn 
3 95 

Cys Val Gly Leu 
410 

His He Glu Val 
425 

Ala Asp Arg Asp 

Glu Pro Pro Gin 

460 

Gly Glu Val Thr 
475 

Tyr Ala Gin Val 
490 

Glu Val Val Lys 
505 

Val Thr Gly Cys 



Gly Arg Phe Arg 
15 

Ala Ala Gly Ala 
30 

Leu Val He Gly 
45 

Ala Gin Leu Gly 

Pro Arg Gly Thr 

80 

Cys lie Pro Lys 
95 

He Arg Asp Ala 
110 

His Asn Trp Lys 
125 

Leu Asn Trp Gly 

Phe Asn He Lys 

160 

Asp Lys Ala Gly 
175 

Ala Thr Gly Gly 
190 

Glu Hi b Gly He 
205 

Gly Lys Thr Leu 

Gly Phe Leu Thr 

240 

Val Pro Leu Arg 
255 

His Met Glu Ser 
270 

Leu He Arg Lys 
285 

Leu Ala Ser Gly 

Ala He Gly Arg 

320 

Gly Val Asn Thr 
335 

Glu Ala Thr Ser 
350 

Gly Arg Pro Glu 
365 

Ala Gin Arg Leu 

Val Pro Thr Thr 

400 

Ser Glu Glu Glu 
415 

Tyr His Ala Tyr 
430 

Ala Ser Gin Cys 
445 

Leu Val Leu Gly 

* 

Gin Gly Phe Ala 

480 

Met Gin Thr Val 
495 

Leu His He Ser 

510 
Xaa Gly 
525 
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<210> 313 
<211> 499 
<212> PRT 
<213> Sub Scrofa 

<220> 

<221> VARIANT 
<222> 498 

<223> Xaa = Any Amino Acid 



<400> 313 



Met 


Asn Gly Ala 


Glu 


Glu 


Leu 


Pro 


Glu 


Met 


Tyr 


Asp 


Tyr Asp 


Leu 


I ±e 


1 








5 










10 








15 




He 


He 


Gly 


Gly Gly 


Ser 


Gly Gly 


Leu 


Ala 


Ala 


Ala 


Lys GJ.U 


A-La 


Axa 








20 










25 








30 






Arg 


Phe 


Asn 


Lys 


Arg 


Val 


Met 


Val 


Leu 


Asp 


Phe 


Val 


Tnr Pro 


Tnr 


fro 




35 






40 










45 






Leu 


Gly 


Thr 


Arg 


Trp 


Gly 


Leu Gly 


Gly 


Tnr 


Cys 


Val 


Asn Val 


Ser 


LyS 




50 










55 










60 








He 


Pro 


Lys 


Lys 


Leu 


Met 


His 


Gin 


Ala 


Ala 


Leu 


Leu 


Gly Gin 


Ala 


Leu 


65 






70 










75 








80 


Arg 


Asp 


Ser Arg Asn Tyr Gly Trp 


Asn 


Val 


Glu 


Glu 


Thr He 


Lys 


TT -J ~ 

rll S 






85 










90 








95 




Asp 


Trp 


Glu 


Arg 


Met 


Thr 


Glu 


Ala 


Val 


Gin 


Asn* His 


He Gly 


Ser 


Leu 




100 










105 








110 






Asn 


Trp Gly Tyr Arg Val 


Ala 


Leu 


Arg 


Glu 


Lys 


Lys 


Val Thr 


Tyr 


G±U 






115 










120 










125 






Asn 


Ala 


Tyr 


Gly Gin 


Phe 


Val 


Gly 


Pro 


His 


Arg 


He 


Lys Ala 


Thr 


Asn 




130 








135 










140 








Asn 


Lys 


Gly Lys 


Glu 


Lys 


He 


Tyr 


Ser 


Ala 


Glu 


Lys 


Phe Leu 


He 


Ala 


145 








150 










155 








160 


Thr 


Gly Glu 


Arg 


Pro 


Arg 


Tyr 


Leu 


Gly 


He 


Pro 


Gly 


Asp Lys 


Glu 


Tyr 










165 










170 








175 




Cys 


He 


Ser 


Ser 


Asp 


Asp 


Leu 


Phe 


Ser 


Leu 


Pro 


Tyr 


Cys Pro 


Gly 


Lys 






180 










185 








190 






Thr 


Leu 


Val 


Val 


Gly Ala 


Ser 


Tyr 


Val 


Ala 


Leu 


Glu 


Cys Ala 


Gly 


Phe 






195 










200 










205 






Leu 


Ala Gly 


He 


Gly 


Leu 


Asp 


Val 


Thr 


Val 


Met 


Val 


Arg Ser 


He 


Leu 




210 










215 










220 








Leu 


Arg Gly 


Phe 


Asp 


Gin 


Asp Met 


Ala 


Asn 


Lys 


He 


Gly Glu His 


Met 


225 










230 










235 








240 


Glu 


Glu 


His 


Gly 


He 
245 


Lys 


Phe 


He 


Arg 


Gin 
250 


Phe 


Val 


Pro He 


Lys 
255 


Val 


Glu 


Gin 


He 


Glu 


Ala 


Gly 


Thr 


Pro 


Gly Arg 


Leu 


Arg 


Val Val 


Ala 


G-Ln 








260 








265 








270 






Ser 


Thr 


Asn 
2 75 


Ser 


Glu 


Glu 


He 


He 
280 


Glu 


Gly 


Glu 


Tyr 


Asn Thr 
285 


Val 


Met 


Leu 


Ala 


He 


Gly Arg Asp 


Ala 


Cys 


Thr 


Arg 


Lys 


He 


Gly Leu 


Glu 


Thr 




290 










295 










300 








Val 


Gly Val 


Lys 


lie 


Asn 


Glu 


Lys 


Thr 


Gly 


Lys 


He 


Pro Val 


Thr 


Asp 


305 








310 










315 








320 


Glu 


Glu 


Gin 


Thr 


Asn 


Val 


Pro 


Tyr 


He 


Tyr 


Ala 


He 


Gly Asp 


He 


Leu 










325 








330 








335 




Glu 


Asp 


Lys 


Val 


Glu 


Leu 


Thr 


Pro 


Val 


Ala 


He 


Gin 


Ala Gly 


Arg 


Leu 




340 










345 








350 






Leu 


Ala 


Gin 


Arg 


Leu 


Tyr 


Ala Gly 


Ser 


Thr 


Val 


Lys 


Cys Asp 


Tyr 


Glu 






355 










360 










365 






Asn 


Val 
370 


Pro 


Thr 


Thr 


Val 


Phe 
375 


Thr 


Pro 


Leu 


Glu 


Tyr 
380 


Gly Ala 


Cys 


Gly 


Leu 


Ser 


Glu 


Glu 


Lys 


Ala 


Val 


Glu 


Lys 


Phe 


Gly 


Glu 


Glu Asn 


He 


Glu 


385 








390 








395 








400 


Val 


Tyr 


His 


Ser 


Tyr 


Phe 


Trp 


Pro 


Leu 


Glu 


Trp 


Thr 


He Pro 


Ser 


Arg 








405 








410 








415 




Asp 


Asn 


Asn 


Lys 


Cys 


Tyr 


Ala 


Lys 


He 


He 


Cys 


Asn 


Thr Lys 


Asp 


Asn 






420 










425 








430 






Glu 


Arg 


Val 


Val 


Gly Phe 


His 


Val 


Leu 


Gly 


Pro 


Asn 


Ala Gly 


Glu 


Val 
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435 

Thr Gin Gly Phe Ala 
450 

Leu Asp Ser Thr lie 
465 

Thr Leu Ser Val Thr 

485 

Cys Xaa Gly 



440 

Ala Ala Leu Lys Cys Gly 
455 

Gly He His Pro Val Cys 
470 475 
Lys Arg Ser Gly Ala Ser 

490 



445 

Leu Thr -Lys Lys Gin 
460 

Ala Glu Val Phe Thr 

480 

He Leu Gin Ala Gly 

495 
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